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BACTERIAL PROMOTERS AND METHODS OF USE 

RELATED APPLICATIONS 

5 This application claims priority to United States Provisional Patent 

Application 60/259434, filed December 27, 2000, by Robert Haselbeck, and entitled 
"BACTERIAL PROMOTERS AND METHODS OF USE" and United States Patent 
Application Serial No. 09/948,993, filed September 6, 2001, by Daniel Tishkoff, and 
entitled "RAPID METHOD FOR REGULTING GENE EXPRESSION", which 

10 claims priority to United States Provisional Patent Application Serial No. 60/230,335, 
filed September 6, 2000, by Daniel Tishkoff, and entitled "RAPID REPLACEMENT 
OF GENOMIC PROMOTERS TO GENERATE STRAINS FOR USE IN A CELL- 
BASED ASSAY FOR ANTIBIOTICS", the disclosures of which are incorporated 
herein by reference in their entirety. 

15 

FIELD OF THE INVENTION 
The invention described herein relates to fusion promoters which comprise one 
or more promoters and one or more operators for directing transcription in Gram- 
positive bacteria. More specifically, some embodiments relate to fusion promoters 
20 comprising a promoter sequence that has been modified so as to alter its transcription 
activity in Gram-positive organisms linked to a xylose operator, cells having 
constructs that include such fusion promoters, and methods of using of these 
manufactures. 

25 BACKGROUND OF THE INVENTION 

Genetic information is expressed in bacteria by a series of steps involving 
transcription of DNA into mRNA and the subsequent translation of the mRNA into 
polypeptides or proteins. The expression of encoded information is initiated at a 
region on the DNA molecule called a "promoter", onto which RNA polymerase 

30 recognizes and, thereby, initiates transcription. Promoters, which usually precede 
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protein-coding nucleotide sequences ("structural genes"), vary in "strength" depending 
on the ability to recognize and bind RNA polymerase in a manner facilitating the 
initiation of transcription. 

The regulation of transcription levels from certain promoters in bacteria can be 

5 facilitated by "operators" that are positioned adjacent to or within the promoter. 
Operator sequences generally function by interacting with modulator proteins, which 
either activate or repress the transcriptional activity of the promoter. Thus, the 
inherent strength of the promoter and the amount of regulation that can be imposed 
upon the promoter by an associated operator sequence interacting with the modulator 

1 0 protein contribute significantly to the control of transcription. 

Current techniques in molecular biology enable the transfer to bacteria of 
exogenous nucleic acids, which one desires to transcribe. Successful transcription of 
the transferred nucleic acid in the host microorganism, however, depends on the 
association of the transferred nucleic acid with a suitable promoter and/or operator 

15 sequence. Frequently, the promoter and/or operator that is naturally associated with 
the nucleic acid to be transcribed will not support desired levels of transcription in the 
new host and, a new promoter and/or operator sequence must be operably linked to the 
nucleic acid. A variety of promoters and operators have been studied in this regard. 
(See e.g„ von Gabain, et aL, Proc. Natl. Acad. Sci. (USA), 76:189-193 (1979); 

20 Casadaban, et aL, J. Molecular Biology, 138:179-207 (1980); West, et aL, Gene. 
9:175-193 (1980); (Chang et al t Nature, 275:615 (1978); Itakura et aL, Science, 
198:1056 (1977); Goeddel et aL, Nature, 281:544 (1979); and (Goeddel et aL, Nucleic 
Acids Res., 8:4057 (1980); and EPO Application Publication No. 0036776). 
However, many of the currently available regulatable promoters are designed for use 

25 in Gram-negative bacteria, such as E. coli. There remains a need for regulatable 
promoters that function in Gram-positive bacteria. In particular, the need exists for 
promoters that can be used to achieve the precise levels of nucleic acid and/or protein 
expression desired for applications such as cell-based gene and drug discovery in 
Gram-positive bacteria. For example, in the discovery of proliferation-required genes, 

30 die expression of a proliferation-required mRNA which is transcibed at a low level 
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may be sufficiently reduced by the levels of antisense transcript produced from a 
promoter having a low maximal level of transcriptional activity to inhibit cell growth. 
However, the expression of a proliferation-required mRNA which is transcribed at 
higher levels may only be sufficiently reduced by levels of antisense transcript that can 

5 produced from a promoter having a high maximal transcriptional activity. As a result, 
the number of types of proliferation-required genes that can be discovered are 
determined in part by the transcriptional activity of the promoter used to express the 
proliferation-inhibiting antisense RNA. Accordingly, there exists a need for a 
promoter suite having a wide dynamic range of transcriptional activities in Gram- 

10 positive bacteria. In methods in which genes required for proliferation of Gram- 
positive organisms are identified by transcribing an antisense nucleic acid which is 
complementary to at least a portion of such genes, the endogenous transcription levels 
of the genes may vary. Accordingly, it is valuable to have a suite of promoters which 
provide a wide range of antisense nucleic acid transcription levels to ensure that a 

15 sufficient amount of antisense nucleic acid is obtained to allow the identification of 
proliferation-required genes which are normally expressed at high levels in the Gram- 
positive organism: In drug discovery applications which utilize Gram-positive cells 
that are sensitized by the expression of proliferation-inhibiting antisense RNAs, the 
degree to which the cells are sensitized is an important factor in determining the 

20 candidate compounds that are discovered. Furthermore, the expression of some 
proliferation-required target genes may be so greatly reduced by the production of 
proliferation-inhibiting antisense RNA from a highly active promoter that, rather than 
becoming sensitized, the cell dies. Accordingly, there exists a need for a promoter 
suite containing a collection of promoters which provide a wide range of basal 

25 transcription levels, maximal transcription levels, and degrees of induction which can 
be used to obtain a level of antisense nucleic acid which is optimal for the 
identification of a particular gene as being required for proliferation or optimal for 
identifying compounds which act on the gene product of a particular gene which is 
required for proliferation in cell-based assays using sensitized cells as described 
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herein. Thus, there remains a need for a suite of promoters having the ability to tightly 
and/or finely regulate transcription in Gram-positive bacteria. 

BRIEF SUMMARY OF THE INVENTION 
5 Some aspects of the present invention are described in the numbered 

paragraphs below. 

1. An isolated nucleic acid comprising a fusion promoter said fusion 
promoter comprising at least one promoter that is modified to have altered activity in 
at least one gram-positive organism said promoter being linked to at least one operator 

10 selected from the group consisting of xylO, tetO, trpO, malO and AclO, wherein said 
at least one operator is positioned such that binding of at least one repressor to said at 
least one operator represses transcription from said fusion promoter. 

2. The isolated fusion promoter of Claim 1, wherein said at least one 
promoter is selected from the group consisting of SEQ ID NOs.: 36-45. 

15 3. The isolated nucleic acid of Claim 1, wherein said at least one operator 

isxy/0. 

4. The isolated nucleic acid of Claim 3, wherein said at least one 
promoter is T5. 

5. The isolated nucleic acid of Claim 3, further comprising a second 
20 operator. 

6. The isolated nucleic acid of Claim 5, wherein said second operator is 

lacO. 

7. The isolated nucleic acid of Claim 1, wherein said fusion promoter is 
responsive to an inducer. 

25 8. The isolated nucleic acid of Claim 7, wherein said inducer is xylose. 

9. The isolated nucleic acid of Claim 1, wherein said fusion promoter is 
titratable. 

10. The isolated nucleic acid of Claim 1, wherein said at least one gram- 
positive organism is selected from the group consisting of Bacillus anthracis, 

30 Clostridium Botulinum, Clostridium difficile, Clostridium perfringens, Clostridium 
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teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, 
Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 
tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus 
epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus 
5 mutans and any species falling within the genera of any of the above species. 

11. The isolated nucleic acid of Claim 1, wherein said at least one gram- 
positive organism is Staphylococcus aureus. 

12. The isolated nucleic acid of Claim 1, wherein said at least one gram- 
positive organism is Enterococcus faecalis. 

10 13. An isolated nucleic acid comprising a fusion promoter said fusion 

promoter comprising at least one promoter selected from the group consisting of T5, 
CP25, P32, P59, P1P2 and PL, said promoter being linked to at least one operator 
selected from the group consisting of xylO, tetO, trpO, malO and 2c JO, wherein said 
at least one operator is positioned such that binding of at least one repressor to said at 

1 5 least one operator represses transcription from said fusion promoter. 

14. The isolated nucleic acid of Claim 13, wherein said at least one 
operator is xylO. 

15. The isolated nucleic acid of Claim 14, wherein said at least one 
promoter is T5. 

20 16. The isolated nucleic acid of Claim 14, further comprising a second 

operator. 

17. The isolated nucleic acid of Claim 16, wherein said second operator is 

lacO. 

18. The isolated nucleic acid of Claim 13, wherein said fusion promoter is 
25 responsive to an inducer. 

19. The isolated nucleic acid of Claim 18, wherein said inducer is xylose. 

20. The isolated nucleic acid of Claim 13, wherein said fusion promoter is 
titratable. 

21. The isolated nucleic acid of Claim 13, wherein said at least one gram- 
30 positive organism is selected from the group consisting of Bacillus anthracis, 
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Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium 
teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, 
Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 
tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus 
5 epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus 
mutans and any species falling within the genera of any of the above species. 

22. The isolated nucleic acid of Claim 13, wherein said at least one gram- 
positive organism is Staphylococcus aureus. 

23. The isolated nucleic acid of Claim 13, wherein said at least one gram- 
10 positive organism is Enterococcus faecalis. 

24. An isolated fusion promoter comprising one of SEQ ID NO.: 26-35. 

25. A vector comprising the isolated nucleic acid of Claim 1 . 

26. A vector comprising the isolated nucleic acid of Claim 13. 

27. A vector comprising the isolated nucleic acid of Claim 24. 

15 28. The vector of Claim 25 further comprising at least one replicon 

selected from the group consisting of pl5a, pC194 and pCTl 138. 

29. The vector of Claim 25 further comprising a reporter gene operably 
linked to said fusion promoter. 

30. The vector of Claim 29, wherein said reporter gene is lacL-lacM. 

20 31. The vector of Claim 30, wherein lacL-lacM is derived from 

Leuconostoc mesenteroides. 

32. The vector of Claim 25, wherein said at least one operator is xylO. 

33. The vector of Claim 32, wherein said at least one promoter is T5. 

34. The vector of Claim 32, further comprising a second operator. 
25 35. The vector of Claim 34, wherein said second operator is lacO. 

36. The vector of Claim 25, wherein said fusion promoter is responsive to 
an inducer. 

37. The vector of Claim 36, wherein said inducer is xylose. 

38. The vector of Claim 25, wherein said fusion promoter is titratable. 
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39. The vector of Claim 25, wherein said at least one gram-positive 
organism is selected from the group consisting of Bacillus anthracis, Clostridium 
botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, 
Corynebacterium diptheriae, Enterococcus faeccdis, Enterococcus faecium, 

5 Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 
tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus 
epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus 
mutans and any species falling within the genera of any of the above species. 

40. The vector of Claim 25, wherein said at least one gram-positive 
1 0 organism is Staphylococcus aureus. 

41. The vector of Claim 25, wherein said at least one gram-positive 
organism is Enterococcus faecalis. 

42. The vector of Claim 25, further comprising a random fragment of a 
microbial genome operably linked to said fusion promoter. 

15 43. The vector of Claim 25, further comprising a nucleic acid that encodes 

a peptide, wherein said nucleic acid is operably linked to said fusion promoter. 

44. The vector of Claim 25, further comprising a nucleic acid that is 

complementary to a portion of a microbial genome, wherein said nucleic acid is 

operably linked to said fusion promoter. 
20 45. The vector of Claim 25, further comprising a nucleic acid that encodes 

a molecule that inhibits the proliferation of microbe, wherein said nucleic acid is 

operably linked to said fusion promoter. 

46. A host cell comprising the nucleic acid of Claim 1 . 

47. A host cell comprising the nucleic acid of Claim 13 

25 48. A method of expressing a biomolecule in a cell said method 

comprising: 

(a) introducing a construct into a cell said construct comprising a 
fusion promoter operably linked to a nucleic acid encoding a biomolecule, said 
fusion promoter comprising at least one promoter that is modified to have 
30 altered activity in at least one gram-positive organism said promoter being 
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linked to at least one operator selected from die group consisting of xylO t tetO, 
trpO, malO and sLcIO, wherein said at least one operator is positioned such 
that binding of at least one repressor to said at least one operator represses 
transcription from said fusion promoter, and 
5 (b) transcribing said nucleic acid encoding a biomolecule from said 

fusion promoter. 

49. The method of Claim 48, wherein said at least one operator is xylO. 

50. The method of Claim 49, wherein said at least one promoter is T5. 

51. The method of Claim 49, wherein said fusion promoter further 
1 0 comprises a second operator. 

52. The method of Claim 5 1 , wherein said second operator is lacO. 

53. The method of Claim 48, wherein said fusion promoter is responsive to 
an inducer. 

54. The method of Claim 53, wherein said inducer is xylose. 

15 55. The method of Claim 48, wherein said fusion promoter is titratable. 

56. The method of Claim 48, wherein said at least one gram-positive 
organism is selected from the group consisting of Bacillus anthracis, Clostridium 
botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, 
Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, 

20 Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 
tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus 
epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus 
mutans and any species falling within the genera of any of the above species. 

57. The method of Claim 48, wherein said at least one gram-positive 
25 organism is Staphylococcus aureus. 

58. The method of Claim 48, wherein said at least one gram-positive 
organism is Enterococcus faecalis. 

59. The method of Claim 48, wherein said nucleic acid is a random 
fragment of a microbial genome. 
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60. The method of Claim 48, wherein said biomolecule is a peptide or 
nucleic acid. 

61. The method of Claim 48, wherein said biomolecule is a nucleic acid 
that is complementary to a portion of a microbial genome. 

5 62. The method of Claim 48, wherein said biomolecule is a nucleic acid 

that encodes a molecule that inhibits the proliferation of a microbe. 

63. A biomolecule produced by the method of Claim 48. 

64. A method for identifying genes involved in cellular proliferation said 
method comprising the steps of: 

10 (a) introducing into cells of a cell population a construct comprising an 

inducible fusion promoter operably linked to a nucleic acid, said fusion 
promoter comprising at least one promoter that is modified to have altered 
activity in at least one gram-positive organism said promoter being linked to at 
least one operator selected from the group consisting of xylO, tetO, trpO, malO 

15 and XclO, wherein said at least one operator is positioned such that binding of 

at least one repressor to said at least one operator represses transcription from 
said fusion promoter, 

(b) inducing transcription of said nucleic acid from said inducible 
fusion promoter, 

20 (c) identifying the cells in said cell population whose proliferation is 

reduced in response to the induction of transcription of said nucleic acid; and 

(d) identifying the gene from a cell identified in step (c) to which at 
least a portion of said nucleic acid is complementary. 

65. A method for identifying genes involved in cellular proliferation said 
25 method comprising the steps of: 

(a) introducing into cells of a cell population a construct comprising an 
inducible fusion promoter operably linked to a nucleic acid, said fusion 
promoter comprising at least one promoter selected from the group consisting 
of T5, CP25, P32, P59, P1P2 and PL, said promoter being linked to at least 
30 one operator selected from the group consisting of xylO, tetO, trpO, malO and 
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AclO, wherein said at least one operator is positioned such that binding of at 
least one repressor to said at least one operator represses transcription from 
said fusion promoter, 

(b) inducing transcription of said nucleic acid from said inducible 
5 fusion promoter; 

(c) identifying the cells in said cell population whose proliferation is 
reduced in response to the induction of transcription of said nucleic acid; and 

(d) identifying the gene from a cell identified in step (c) to which at 
least a portion of said nucleic acid is complementary. 

1 o 66. The method of Claim 64, wherein said nucleic acid is random fragment 

of a microbial genome. 

67. The method of Claim 64, wherein said nucleic acid encodes an 
aptamer. 

68. The method of Claim 64, wherein said nucleic acid encodes a transcript 
1 5 that is complementary to a portion of a microbial genome. 

69. The method of Claim 64, wherein said nucleic acid encodes a molecule 
that inhibits the proliferation of a microbe. 

70. The method of Claim 64, wherein said at least one operator is xylO. 

71 . The method of Claim 70, wherein said fusion promoter is T5. 

20 72. The method of Claim 70, wherein said fusion promoter further 

comprises a second operator. 

73. The method of Claim 72, wherein said second operator is lacO. 

74. The method of Claim 64, wherein said inducer is xylose. 

75. The method of Claim 64, wherein said fusion promoter is titratable. 

25 76. The method of Claim 64, wherein said at least one gram-positive 

organism is selected from the group consisting of Bacillus anthracis, Clostridium 
botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, 
Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, 
Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 

30 tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus 
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epidermidis. Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus 
mutans and any species falling within the genera of any of the above species. 

77. The method of Claim 64, wherein said at least one gram-positive 
organism is Staphylococcus aureus. 

78. The method of Claim 64, wherein said at least one gram-positive 
organism is Enterococcus faecalis* 

79. A method for identifying genes involved in cellular proliferation said 
method comprising the steps of: 

(a) introducing into the genome of a cell a construct comprising an 
inducible fusion promoter operably linked to a nucleic acid, said fusion 
promoter comprising at least one promoter that is modified to have altered 
activity in at least one gram-positive organism said promoter being linked to at 
least one operator selected from the group consisting of xylO, tetO> trpO, malO 
and Acl0 9 wherein said at least one operator is positioned such that binding of 
at least one repressor to said at least one operator represses transcription from 
said fusion promoter; 

(b) comparing the proliferation of said cell cultured in the presence of a 
first concentration of an inducer that induces transcription from said fiision 
promoter with the proliferation of said cell cultured in the presence of a 
concentration of said inducer that is less than said first concentration, wherein 
a difference in proliferation indicates that said fusion promoter modulates a 
gene that is required for proliferation; and 

(c) identifying the gene that is modulated by said fusion promoter. 

80. A method for identifying genes involved in cellular proliferation said 
method comprising the steps of: 

(a) introducing into the genome of a cell a construct comprising an 
inducible fusion promoter operably linked to a nucleic acid, said fusion 
promoter comprising at least one promoter selected from the group consisting 
of T5, CP25, P32, P59, P1P2 and PL, said promoter being linked to at least 
one operator selected from the group consisting of xylO, tetO, trpO, malO and 
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XclO, wherein said at least one operator is positioned such that binding of at 
least one repressor to said at least one operator represses transcription from 
said fusion promoter, 

(b) comparing the proliferation of said cell cultured in the presence of a 

5 first concentration of an inducer that induces transcription from said fusion 

promoter with the proliferation of said cell cultured in the presence of a 
concentration of said inducer that is less than said first concentration, wherein 
a difference in proliferation indicates that said fusion promoter modulates a 
gene that is required for proliferation; and 

10 (c) identifying the gene that is modulated by said fusion promoter. 

8 1 . The method of Claim 79, wherein said nucleic acid is random fragment 
of a microbial genome. 

82. The method of Claim 79, wherein said nucleic acid encodes an 
aptamer. 

1 5 83. The method of Claim 79, wherein said nucleic acid encodes a transcript 

that is complementary to a portion of a microbial genome. 

84. The method of Claim 79, wherein said nucleic acid encodes a molecule 
that inhibits the proliferation of a microbe. 

85. The method of Claim 79, wherein said at least one operator is xylO. 
20 86. The method of Claim 85, wherein said at least one promoter is T5. 

87. The method of Claim 85, wherein said fusion promoter further 
comprises a second operator. 

88. The method of Claim 87 wherein said second operator is lacO, 

89. The method of Claim 79, wherein said inducer is xylose. 

25 90. The method of Claim 79, wherein said fusion promoter is titratable. 

91. The method of Claim 79, wherein said at least one gram-positive 
organism is selected from the group consisting of Bacillus anthracis, Clostridium 
botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, 
Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, 

30 Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 
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tuberculosis, Nocardia asteroides. Staphylococcus aureus, Staphylococcus 
epidermidis, Staphylococcus xylosis> Streptococcus pneumoniae, Streptococcus 
mutatis and any species falling within the genera of any of fee above species. 

92. The method of Claim 79, wherein said at least one gram-positive 
5 organism is Staphylococcus aureus. 

93. The method of Claim 79, wherein said at least one gram-positive 
organism is Enterococcus faecalis. 

94. A method for identifying a compound which reduces the activity or 
level of a gene product required for proliferation of a cell said method comprising: 

10 (a) introducing into a cell a construct comprising an inducible fusion 

promoter operably linked to a nucleic acid that is complementary to at least a 
portion of a proliferation-required gene, said fusion promoter comprising at 
least one promoter that is modified to have altered activity in at least one 
gram-positive organism said promoter being linked to at least one operator 

15 selected from the group consisting of xylO, tetO, trpO, malO and Ac70, 

wherein said at least one operator is positioned such that binding of at least one 
repressor to said at least one operator represses transcription from said fusion 
promoter; 

(b) sensitizing said cell by inducing transcription from said fusion 
20 promoter; 

(c) contacting said sensitized cell with a compound; and 

(d) determining the degree to which said compound inhibits 
proliferation of said sensitized cell relative to a cell which has not been 
sensitized. 

25 95. A method for identifying a compound which reduces the activity or 

level of a gene product required for proliferation of a cell said method comprising: 

(a) introducing into a cell a construct comprising an inducible fusion 
promoter operably linked to a nucleic acid that is complementary to at least a 
portion of a proliferation-required gene, said fusion promoter comprising at 
30 least one promoter selected from the group consisting of T5, CP25, P32 f P59, 
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P1P2 and PL, said promoter being linked to at least one operator selected from 
the group consisting of xy 10, tetO, trpO, malO and Ac/O, wherein said at least 
one operator is positioned such that binding of at least one repressor to said at 
least one operator represses transcription from said fusion promoter, 
5 (b) sensitizing said cell by inducing transcription from said fusion 

promoter, 

(c) contacting said sensitized cell with a compound; and 

(d) determining the degree to which said compound inhibits 
proliferation of said sensitized cell relative to a cell which has not been 

10 sensitized. 

96. The method of Claim 94, wherein said at least one operator is xylO. 

97. The method of Claim 96, wherein said at least one promoter is T5. 

98. The method of Claim 96, wherein said fusion promoter further 
comprises a second operator. 

15 99. The method of Claim 98, wherein said second operator is lacO. 

100. The method of Claim 94, wherein said inducer is xylose. 

101. The method of Claim 94, wherein said fusion promoter is titratabie. 

102. The method of Claim 94, wherein said at least one gram-positive 
organism is selected from the group consisting of Bacillus anthracis, Clostridium 

20 botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, 
Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, 
Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 
tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus 
epidermidis, Staphylococcus xylosb, Streptococcus pneumoniae. Streptococcus 

25 mutans and any species falling within the genera of any of the above species. 

103. The method of Claim 94, wherein said at least one gram-positive 
organism is Staphylococcus aureus. 

104. The method of Claim 94, wherein said at least one gram-positive 
organism is Enterococcus faecalis. 

30 105. A compound identified using the method of Claim 94. 
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106. A method for inhibiting the activity or expression of a gene in an 
operon required for proliferation said method comprising: 

(a) introducing into a cell a construct comprising an inducible fusion 
promoter operably linked to a nucleic acid that is complementary to at least a 

5 portion of a proliferation-required operon, said fusion promoter comprising at 

least one promoter that is modified to have altered activity in at least one 
gram-positive organism said promoter being linked to at least one operator 
selected from the group consisting of xylO, tetO> trpO, malO and telO y 
wherein said at least one operator is positioned such that binding of at least one 
10 repressor to said at least one operator represses transcription from said fusion 

promoter; and 

(b) inducing transcription from said fusion promoter. 

107. A method for inhibiting the activity or expression of a gene in an 
operon required for proliferation said method comprising: 

15 (a) introducing into a cell a construct comprising an inducible fusion 

promoter operably linked to a nucleic acid that is complementary to at least a 
portion of a proliferation-required operon, said fusion promoter comprising at 
least one promoter selected from the group consisting of T5, CP25, P32, P59, 
P1P2 and PL, said promoter being linked to at least one operator selected from 

20 the group consisting of xylO, tetO, trpO, malO and XclO, wherein said at least 

one operator is positioned such that binding of at least one repressor to said at 
least one operator represses transcription from said fusion promoter; and 
(b) inducing transcription from said fusion promoter. 

108. The method of Claim 106, wherein said at least one operator is xylO. 
25 1 09. The method of Claim 108, wherein said at least one promoter is T5. 

110. The method of Claim 108, wherein said fusion promoter further 
comprises a second operator. 

111. The method of Claim 1 10, wherein said second operator is lacO. 

1 1 2. The method of Claim 106, wherein said inducer is xylose. 

30 113. The method of Claim 106, wherein said fusion promoter is titratable. 
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114. The method of Claim 106, wherein said at least one gram-positive 
organism is selected from the group consisting of Bacillus anthracis, Clostridium 
botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, 
Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, 

5 Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 
tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus 
epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus 
mutans and any species falling within the genera of any of the above species. 

115. The method of Claim 106, wherein said at least one gram-positive 
10 organism is Staphylococcus aureus. 

116. The method of Claim 106, wherein said at least one gram-positive 
organism is Enterococcus faecalis. 

117. A method for identifying the biological pathway in which a 
proliferation-required gene or its gene product lies said method comprising: 

15 (a) introducing into a test cell a construct comprising an inducible 

fusion promoter operably linked to a nucleic acid that is complementary to at 
least a portion of a proliferation-required gene, said fusion promoter 
comprising at least one promoter that is modified to have altered activity in at 
least one gram-positive organism said promoter being linked to at least one 

20 operator selected from the group consisting of xylO, tetO, trpO, malO and 

/IclO, wherein said at least one operator is positioned such that binding of at 
least one repressor to said at least one operator represses transcription from 
said fusion promoter; 

(b) inhibiting the activity of said proliferation-required gene or gene 
25 product in said test cell by inducing transcription from said fusion promoter; 

(c) contacting said test cell with a compound known to inhibit growth 
or proliferation of a cell, wherein the biological pathway on which said 
compound acts is known; and 
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(d) determining the degree to which said compound inhibits 
proliferation of said test cell relative to a cell which has not been contacted 
with said compound 

118. A method for identifying the biological pathway in which a 
5 proliferation-required gene or its gene product lies said method comprising: 

(a) introducing into a test cell a construct comprising an inducible 
fusion promoter operably linked to a nucleic acid that is complementary to at 
least a portion of a proliferation-required gene, said fusion promoter 
comprising at least one promoter selected from the group consisting of T5, 
10 CP25, P32, P59, P1P2 and PL, said promoter being linked to at least one 

operator selected from the group consisting of xylO, tetO, trpO, malO and 
AclO, wherein said at least one operator is positioned such that binding of at 
least one repressor to said at least one operator represses transcription from 
said fusion promoter; 

15 (b) inhibiting the activity of said proliferation-required gene or gene 

product in said test cell by inducing transcription from said fusion promoter; 

(c) contacting said test cell with a compound known to inhibit growth 

or proliferation of a cell, wherein the biological pathway on which said 

compound acts is known; and 
20 (d) determining the degree to which said compound inhibits 

proliferation of said test cell relative to a cell which has not been contacted 

with said compound. 

1 1 9. The method of Claim 1 1 7, wherein said at least one operator is xylO, 

120. The method of Claim 1 19, wherein said at least one promoter is T5. 

25 121. The method of Claim 119, wherein said fusion promoter further 

comprises a second operator. 

122. The method of Claim 121, wherein said second operator is lacO. 

123. The method of Claim 1 17, wherein said inducer is xylose. 

1 24. The method of Claim 1 1 7, wherein said fusion promoter is titratable. 
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125. The method of Claim 117, wherein said at least one gram-positive 
organism is selected from the group consisting of Bacillus anthracis, Clostridium 
botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, 
Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, 
5 Lactococcus lactis t Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 
tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus 
epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus 
mutans and any species falling within the genera of any of the above species. 



10 organism is Staphylococcus aureus. 

111. The method of Claim 1 17, wherein said at least one gram-positive 
organism is Enterococcus faecalis. 

128. A method for determining the biological pathway on which a test 
compound acts said method comprising: 
15 (a) introducing into a ceil a construct comprising an inducible fusion 



126. The method of Claim 117, wherein said at least one gram-positive 



20 



promoter operably linked to a nucleic acid that is complementary to at least a 
portion of a proliferation-required gene, wherein the biological pathway in 
which said gene or product of said gene lies is known, and wherein said fusion 
promoter comprises at least one promoter that is modified to have altered 
activity in at least one gram-positive organism said promoter being linked to at 
least one operator selected from the group consisting of xylO, tetO, trpO, malO 
and XclO, wherein said at least one operator is positioned such that binding of 
at least one repressor to said at least one operator represses transcription from 
said fusion promoter; 



25 



(b) inhibiting the activity of said proliferation-required in said cell by 
inducing transcription from said fusion promoter; 



(c) contacting said cell with a test compound; and 



(d) determining the degree to which said test compound inhibits 
proliferation of said cell relative to a cell containing a which does not contain 



30 



said construct. 
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129. A method for determining the biological pathway on which a test 
compound acts said method comprising: 

(a) introducing into a cell a construct comprising an inducible fusion 
promoter operably linked to a nucleic acid that is complementary to at least a 

5 portion of a proliferation-required gene, wherein the biological pathway in 

which said gene or product of said gene lies is known, and wherein said fusion 
promoter comprises at least one promoter selected from the group consisting of 
T5, CP25, P32, P59, P1P2 and PL, said promoter being linked to at least one 
operator selected from the group consisting of xylO, tetO, trpO, malO and 

10 Ac JO, wherein said at least one operator is positioned such that binding of at 

least one repressor to said at least one operator represses transcription from 
said fusion promoter; 

(b) inhibiting the activity of said proliferation-required in said cell by 
inducing transcription from said fusion promoter; 

1 5 (c) contacting said cell with a test compound; and 

(d) determining the degree to which said test compound inhibits 
proliferation of said cell relative to a cell containing a which does not contain 
said construct. 

130. The method of Claim 128, wherein said at least one operator is xylO. 
20 131. The method of Claim 130, wherein said at least one promoter is T5. 

132. The method of Claim 130, wherein said fusion promoter further 
comprises a second operator. 

133. The method of Claim 1 32, wherein said second operator is lacO. 

134. The method of Claim 128, wherein said inducer is xylose. 

25 135. The method of Claim 128, wherein said fusion promoter is titratable. 

136. The method of Claim 128, wherein said at least one gram-positive 
organism is selected from the group consisting of Bacillus anthracis, Clostridium 
botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, 
Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, 

30 Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 
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tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus 
epidermidis t Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus 
mutans and any species falling within the genera of any of the above species. 

137. The method of Claim 128, wherein said at least one gram-positive 
5 organism is Staphylococcus aureus. 

138. The method of Claim 128, wherein said at least one gram-positive 
organism is Enterococcus faecalis. 

139. A method of manufacturing an antibiotic comprising the steps of: 

(a) introducing into a cell a construct comprising an inducible fusion 
10 promoter operably linked to a nucleic acid that is complementary to at least a 

portion of a proliferation-required gene, said fusion promoter comprising at 
least one promoter that is modified to have altered activity in at least one 
gram-positive organism said promoter being linked to at least one operator 
selected from the group consisting of xylO, tetO, trpO, malO and XclO, 
15 wherein said at least one operator is positioned such that binding of at least one 

repressor to said at least one operator represses transcription from said fusion 
promoter; 

(b) sensitizing said cell by inducing transcription from said fusion 
promoter; 

20 (c) contacting said sensitized cell with a compound; 

(d) identifying a compound which substantially inhibits the 
proliferation of said sensitized cell relative to a cell which has not been 
sensitized; and 

(e) manufacturing the compound so identified. 

25 1 40. A method of manufacturing an antibiotic comprising the steps of: 

(a) introducing into a cell a construct comprising an inducible fusion 
promoter operably linked to a nucleic acid that is complementary to at least a 
portion of a proliferation-required gene, said fusion promoter comprising at 
least one promoter selected from the group consisting of T5, CP25, P32, P59, 

30 P1P2 and PL, said promoter being linked to at least one operator selected from 
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the group consisting of xylO, tetO, trpO, malO and ZclO, wherein said at least 
one operator is positioned such that binding of at least one repressor to said at 
least one operator represses transcription from said fusion promoter, 

(b) sensitizing said cell by inducing transcription from said fusion 
5 promoter; 

(c) contacting said sensitized cell with a compound; 

(d) identifying a compound which substantially inhibits the 
proliferation of said sensitized cell relative to a cell which has not been 
sensitized; and 

1 0 (e) manufacturing the compound so identified. 

141 . The method of Claim 139, wherein said at least one operator is xylO. 

142. The method of Claim 141, wherein said at least one promoter is T5. 

143. The method of Claim 141, wherein said fusion promoter further 
comprises a second operator. 

15 144. The method of Claim 143, wherein said second operator is lacO. 

US. The method of Claim 139, wherein said inducer is xylose. 

146. The method of Claim 139, wherein said fusion promoter is titratable. 

147. The method of Claim 139, wherein said at least one gram-positive 
organism is selected from the group consisting of Bacillus anthracis, Clostridium 

20 botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, 
Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, 
Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 
tuberculosis. Nocardia asteroides, Staphylococcus aureus, Staphylococcus 
epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus 

IS mutans and any species falling within the genera of any of the above species. 

148. The method of Claim 139, wherein said at least one gram-positive 
organism is Staphylococcus aureus. 

149. The method of Claim 139, wherein said at least one gram-positive 
organism is Enterococcus faecalis. 
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150. A method for identifying a gene which is required for proliferation of a 
prokaryotic cell said method comprising the steps of: 

(a) replacing the native promoter of a gene in the chromosome of a 
prokaryotic cell having an enhanced frequency of homologous recombination 

5 with a regulatable fusion promoter comprising at least one promoter that is 

modified to have altered activity in at least one gram-positive organism said 
promoter being linked to at least one operator selected from the group 
consisting of xylO, tetO, trpQ, malO, lacO and AclO, wherein said at least one 
operator is positioned such that binding of at least one repressor to said at least 

10 one operator represses transcription from said fusion promoter; and 

(b) identifying cells in which the extent of proliferation of said cell 
when said fusion promoter is active at a first level is substantially different 
than the extent of proliferation of said cell when said fusion promoter is active 
at a second level, said first level being greater than said second level. 

15 1 5 1 . A method for identifying a gene which is required for proliferation of a 

prokaryotic cell said method comprising the steps of: 

(a) replacing the native promoter of a gene in the chromosome of a 
prokaryotic cell having an enhanced frequency of homologous recombination 
with a regulatable fusion promoter comprising at least one promoter selected 

20 from the group consisting of T5, CP25, P32, P59, P1P2 and PL, said promoter 

being linked to at least one operator selected from the group consisting of 
xylO, tetO, trpO, malO, lacO and JxlO y wherein said at least one operator is 
positioned such that binding of at least one repressor to said at least one 
operator represses transcription from said fusion promoter; and 

25 (b) identifying cells in which the extent of proliferation of said cell 

when said fusion promoter is active at a first level is substantially different 
than the extent of proliferation of said cell when said fusion promoter is active 
at a second level, said first level being greater than said second level 
152. The method of Claim 150, wherein said fusion promoter is an inducible 

30 promoter. 
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153. The method of Claim 150, wherein the step of replacing said native 
promoter comprises introducing a linear nucleic acid comprising a 5' portion 
homologous to a first portion of said native promoter, a 3' portion homologous to a 
second portion of said native promoter and said fusion promoter disposed between 
said 5' portion and said 3* portion into said cell such that homologous recombination 
occurs between said 5' portion and said first portion of said native promoter and 
between said 3* portion and said second portion of said native promoter. 

154. The method of Claim 153, wherein said linear nucleic acid is double 
stranded. 

155. The method of Claim 153, wherein said linear nucleic acid is single 
stranded. 

156. A method for identifying a compound which inhibits the proliferation 
of a prokaryotic cell said method comprising the steps of: 

(a) replacing the native promoter of a gene in the chromosome of a 
prokaryotic cell having an enhanced frequency of homologous recombination 
with a regulatable fusion promoter comprising at least one promoter that is 
modified to have altered activity in at least one gram-positive organism said 
promoter being linked to at least one operator selected from the group 
consisting of xylO, tetO, trpO, malQ, lacO and Ac/0, wherein said at least one 
operator is positioned such that binding of at least one repressor to said at least 
one operator represses transcription from said fusion promoter, and 

(b) comparing the extent of proliferation of a first sample of said cell in 
the presence of said compound to the extent of proliferation of a second 
sample of said cell in the presence of said compound, wherein said first sample 
of said cell has a reduced activity of said fusion promoter activity relative the 
activity of said fusion promoter in said second sample of said cell and wherein 
said compound inhibits the proliferation of said cell if the extent of 
proliferation of said first sample of said cell is substantially less than the extent 
of proliferation of said second sample of said cell. 
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157. A method for identifying a compound which inhibits the proliferation 
of a piokaryotic cell said method comprising the steps of: 

(a) replacing the native promoter of a gene in the chromosome of a 
prokaryotic cell having an enhanced frequency of homologous recombination 

5 with a regulatable fusion promoter comprising at least one promoter selected 

from the group consisting of T5, CP25 f P32, P59, P1P2 and PL, said promoter 
being linked to at least one operator selected from the group consisting of 
xylO, tetO, trpO, malO, lacO and AclO, wherein said at least one operator is 
positioned such that binding of at least one repressor to said at least one 

10 operator represses transcription from said fusion promoter; and 

(b) comparing the extent of proliferation of a first sample of said cell in 
the presence of said compound to the extent of proliferation of a second 
sample of said cell in the presence of said compound, wherein said first sample 
of said cell has a reduced activity of said fusion promoter activity relative the 

15 activity of said fusion promoter in said second sample of said cell and wherein 

said compound inhibits the proliferation of said cell if the extent of 
proliferation of said first sample of said cell is substantially less than the extent 
of proliferation of said second sample of said cell.. 

158. The method of Claim 156, wherein the step of replacing said native 
20 promoter comprises introducing a linear nucleic acid comprising a 5* portion 

homologous to a first portion of said native promoter, a 3' portion homologous to a 
second portion of said native promoter and said fusion promoter disposed between 
said 5' portion and said 3' portion into said cell such that homologous recombination 
occurs between said 5' portion and said first portion of said native promoter and 
25 between said 3 ' portion and said second portion of said native promoter. 

159. The method of Claim 158, wherein said linear nucleic acid is double 
stranded. 

160. The method of Claim 158, wherein said linear nucleic acid is single 
stranded. 
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161. A method for identifying a gene which is required for proliferation of a 
prokaryotic cell said method comprising the steps of: 

(a) introducing at least one operator selected from the group consisting 
of Ay/0, tetO, trpO, malO. lacO and At JO into a prokaryotic cell having an 

5 enhanced frequency of homologous recombination such that said at least one 

operator regulates transcription of a target nucleic acid in the chromosome of 
said cell; and 

(b) identifying cells in which the extent of proliferation of said cell 
when said target nucleic acid is transcribed at a first level is substantially 

10 different than the extent of proliferation of said cell when said target nucleic 

acid is transcribed at a second level, said first level being greater than said 
second level. 

162. The method of Claim 161, wherein the step of introducing said at least 
one operator comprises introducing a linear nucleic acid comprising a 5' portion 

15 homologous to a first portion of the chromosome of said cell, a 3* portion homologous 
to a second portion of the chromosome of said cell and said at least one operator 
disposed between said 5* portion and said 3' portion into said cell such that 
homologous recombination occurs between the 5' portion and said first portion of the 
chromosome of said cell and between the 3' portion and said second portion of the 

20 chromosome of said cell. 

163. The method of Claim 162, wherein said linear nucleic acid is double 
stranded. 

164. The method of Claim 162, wherein said linear nucleic acid is single 
stranded. 

25 165. A method of identifying a compound which inhibits the proliferation of 

a prokaryotic cell said method comprising the steps of: 

(a) obtaining a prokaryotic cell in which transcription of a nucleic acid 
required for proliferation of said cell is regulated by at least one operator 
which has been introduced into the chromosome of said cell said at least one 
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operator selected from the group consisting of xylO, tetO, trpO, malO, lacO 
and^iO; 

(b) sensitizing said cell by growing said cell under conditions in which 
the level of transcription of said gene is lower than that of a wild type cell; 
5 (c) contacting said sensitized cell with said compound; and 

(d) determining the degree to which said compound inhibits the growth 
of said sensitized cell relative to an unsensitized cell. 

1 66. The method of Claim 165, wherein said cell has an enhanced frequency 
of homologous recombination. 
10 167. The method of Claim 165, wherein said linear nucleic acid is single 

stranded. 

168. The method of Claim 165, wherein said linear nucleic acid is double 
stranded. 

169. A method of identifying a nucleic acid sequence having promoter 
1 5 activity in Enterococcus faecalis said method comprising the steps of: 

(a) inserting a candidate nucleic acid into a vector comprising lacL- 
lacM reporter genes such that said candidate nucleic acid is upstream of the 
lacL-lacM reporter genes; 

(b) introducing said vector comprising said candidate nucleic acid into 
20 Enterococcus faecalis; 

(c) detecting expression of the lacL-lacM reporter genes, wherein 
expression of said lacL-lacM reporter genes indicates that said candidate 
nucleic acid sequence has promoter activity. 

170. The method of Claim 169, wherein said detecting step comprises 
25 measuring P-galactosidase activity. 

171. The method of Claim 169, wherein said candidate nucleic acid is a 
promoter modified to increase activity in a gram-positive organism. 

172. The method of Claim 169, wherein said lacL-lacM reporter gene is 
derived from Leuconostoc mesenteroides. 

30 173. The method of Claim 169, wherein said vector is pEPEFl. 
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Definitions 

The following definitions are provided so as to facilitate the understanding of 
the invention as set out herein. 

5 By "activity against a gene product" is meant having the ability to inhibit the 

function or to reduce the level or activity of the gene product in a cell. This includes, 
but is not limited to, inhibiting the enzymatic activity of the gene product or the ability 
of the gene product to interact with other biological molecules required for its activity, 
including inhibiting the gene product's assembly into a multimeric structure. 

10 By "activity against a protein" is meant having the ability to inhibit the 

function or to reduce the level or activity of the protein in a cell This includes, but is 
not limited to, inhibiting the enzymatic activity of the protein or the ability of the 
protein to interact with other biological molecules required for its activity, including 
inhibiting the protein's assembly into a multimeric structure. 

15 By "activity against a nucleic acid" is meant having the ability to inhibit the 

function or to reduce the level or activity of the nucleic acid in a cell. This includes, 
but is not limited to, inhibiting the ability of the nucleic acid interact with other 
biological molecules required for its activity, including inhibiting the nucleic acid's 
assembly into a multimeric structure. 

20 By "activity against a gene" is meant having the ability to inhibit the function 

or expression of the gene in a cell. This includes, but is not limited to, inhibiting the 
ability of the gene to interact with other biological molecules required for its activity. 

By "activity against an operon" is meant having the ability to inhibit the 
function or reduce the level of one or more products of the operon in a cell. This 

25 includes, but is not limited to, inhibiting the enzymatic activity of one or more 
products of the operon or the ability of one or more products of the operon to interact 
with other biological molecules required for its activity. 

By "antibiotic" is meant an agent which inhibits the proliferation of a cell or 
microorganism. 
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The term "aptamer" as used herein refers to a single-stranded or double- 
stranded oligodeoxyribonucleotide, oligoribonucleotide or modified derivatives of 
oligodeoxyribonucleotides or oligoribonucleotides that specifically binds and alters 
the biological function of a target molecule, which can be a protein, peptide and 

5 derivatives thereof. 

By "biological pathway" is meant any discrete cell function or process that is 
carried out by a gene product or a subset of gene products. Biological pathways 
include anabolic, catabolic, enzymatic, biochemical and metabolic pathways as well as 
pathways involved in the production of cellular structures such as cell walls. 

10 Biological pathways that are usually required for proliferation of cells or 
microorganisms include, but are not limited to, cell division, DNA synthesis and 
replication, RNA synthesis (transcription), protein synthesis (translation), protein 
processing, protein transport, fatty acid biosynthesis, electron transport chains, cell 
wall synthesis, cell membrane production, synthesis and maintenance, and the like. 

15 The term "expression" is defined as the production of a sense or antisense RNA 

molecule from a gene, gene fragment, genomic fragment, chromosome, operon or 
portion thereof. Expression can also be used to refer to the process of peptide or 
polypeptide synthesis. 

An "expression vector" is defined as a vehicle by which a ribonucleic acid 

20 (RNA) is transcribed from a nucleic acid carried within the expression vehicle. The 
expression vector can also contain features that permit translation of a protein product 
from the transcribed RNA message expressed from the exogenous nucleic acid carried 
by the expression vector. Accordingly, an expression vector can produce an RNA 
molecule as its sole product or the expression vector can produce a RNA molecule that 

25 is ultimately translated into a protein product 

By "identifying a compound" is meant to screen one or more compounds in a 
collection of compounds such as a combinatorial chemical library, a natural product 
library or other library of chemical compounds or to characterize a single compound 
by testing the compound in a given assay and determining whether it exhibits the 

30 desired activity. 
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By "inducer" is meant an agent, solution or environmental condition which, 
when placed in contact with a cell or microorganism, increases transcription, or 
inhibitor and/or promoter clearance/fidelity, from a desired promoter. 

By "inhibit activity of a gene or gene product" is meant having the ability to 

5 interfere with the function of a gene or gene product in such a way as to decrease 
expression of the gene, in such a way as to reduce the level or activity of a product of 
the gene or in such a way as to inhibit the interaction of the gene or gene product with 
other biological molecules required for its activity. Agents which inhibit the activity 
of a gene include agents that inhibit transcription of the gene, agents that inhibit 

10 processing of the transcript of the gene, agents that reduce the stability of the transcript 
of the gene, and agents that inhibit translation of the mRNA transcribed from the 
gene. In microorganisms, agents which inhibit the activity of a gene can act to 
decrease expression of the operon in which the gene resides or alter the folding or 
processing of operon RNA so as to reduce the level or activity of the gene product. 

15 The gene product can be a non-translated RNA such as ribosomal RNA, a translated 
RNA (mRNA) or the protein product resulting from translation of the gene mRNA. 
Of particular utility to the present invention are antisense RNAs that have activities 
against the operons or genes to which they specifically hybridize. 

As used herein, "lac operator", "lac operator sequence", or "lacO" is intended 

20 to encompass the operator having SED ID NO.: 11 and any other lac operator 
sequences consistent with the intended uses described herein. 

As used herein, "tel operator", "Acl operator sequence", or "AclO" is 
intended to encompass all lambda operators to which the Xcl repressor or variants 
thereof are capable of binding. Such operators include, but are not limited to, AOri 

25 (SEQ ID NO.: 54), XO R2 (SEQ ID NO.: 55), AOrj (SEQ ID NO.: 56), XO u (SEQ ID 
NO.: 57), Wu (SEQ ID NO.: 58), XO u (SEQ ID NO.: 59). 

As used herein, "ma/ operator", "ma/ operator sequence" or "ma/<?" is 
intended to encompass the two mal operators MalMop (SEQ ID NO.: 60) and MalXop 
(SEQ ID NO.: 61), as described in Nieto et al., J, Biol Chem. 272,30860-30865 
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(1997), the disclosure of which is incorporated herein by reference in its entirety, and 
any other mal operator sequences consistent with the intended uses described herein. 

As used herein, "nucleic acid" means DNA, RNA, or modified nucleic acids. 
Thus, the terminology "the nucleic acid of SEQ ID NO: X" or "the nucleic acid 
5 comprising the nucleotide sequence" includes both the DNA sequence of SEQ ID NO: 
X, the complement thereof, and an RNA sequence in which the thymidines in the 
DNA sequence have been substituted with uridines in the RNA sequence and in which 
the deoxyribose backbone of the DNA sequence has been substituted with a ribose 
backbone in the RNA sequence. 

10 As used herein, "polynucleotide" has the same meaning as nucleic acid. 

As used herein, "proliferation-inhibiting" encompasses instances where the 
absence or substantial reduction of a gene transcript and/or gene product completely 
eliminates cell growth as well as instances where the absence of a gene transcript and/or 
gene product merely reduces cell growth. A proliferation-inhibiting antisense nucleic 

15 acid is one that can cause a reduction of a gene transcript and/or gene product that is 
sufficient to reduce or eliminate the growth or viability of the cell or microorganism. 

As used herein, proliferation-required" or Required for proliferation" 
encompasses instances where the absence or substantial reduction of a gene transcript 
and/or gene product completely eliminates cell growth as well as instances where the 

20 absence of a gene transcript and/or gene product merely reduces cell growth. A 
proUferation-required gene or gene family is one where, in the absence or substantial 
reduction of a gene transcript and/or gene product, growth or viability of the cell or 
microorganism is reduced or eliminated. 

As used herein, "sub-lethal" means a concentration of an agent below the 

25 concentration required to inhibit ail cell growth. 

As used herein, "tetracycline analog" or "Tc analog" is intended to include 
compounds which are structurally related to tetracycline and which bind to the Tet 
repressor with a Ka of at least about 10* 6 M Preferably, the tetracycline analog binds 
with an affinity of about 10" 9 M or greater. Examples of such tetracycline analogs 

30 include, but are not limited to, anhydrotetracycline (ate), doxycycline, 
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chlorotetracycline, oxytetracycline and others disclosed by Hlavka and Boothe, "The 
Tetracyclines," in Handbook of Experimental Pharmacology 78, R. K. Blackwood et 
al. (eds.), Springer- Verlag, Berlin, N.Y., 1985; L. A. Mitscher, "The Chemistry of the 
Tetracycline Antibiotics", Medicinal Research 9, Dekker, N.Y., 1978; Noyee 

5 Development Corporation, "Tetracycline Manufacturing Processes" Chemical Process 
Reviews, Park Ridge, N.J., 2 volumes, 1969; R C. Evans, "The Technology of the 
Tetracyclines", Biochemical Reference Series 1, Quadrangle Press, New York, 1968; 
and H. F. Dowling, "Tetracycline", Antibiotic Monographs, no. 3, Medical 
Encyclopedia, New York, 1955. Preferred Tc analog for high-level stimulation of 

10 transcription is anhydrotetracyciine. For use in prokaryotic organisms, a Tc analog can 
be chosen which has reduced antibiotic activity compared to Tc. Examples of such Tc 
analogues are anhydrotetracyciine, epioxytetracycline and cyanotetracycline. 

As used herein, "Tet repressor" is intended to describe a protein occurring in 
nature which represses transcription from tet operator sequences in prokaryotic cells in 

15 the absence of Tc or Tc analog. Tetracycline repressors of the present invention may 
be obtained from any class of tetracycline repressor (e.g., TetR(A), TetR(B), TetR(C), 
TetR(D), TetR(E), TetR(G), TetR(H), TetR(J), TetR(Z), or combinations thereof). The 
nucleotide sequences of Tet repressors of representative members of the A, B, C, D 
and E classes are known (e.g., see Waters, S. H. et al. (1983) Nucl. Acids Res 

20 11:6089-6105, Hillen, W. and Schollmeier, K. (1983) Nucl. Acids Res. 11:525-539 
and Postle, K. et al. (1984) Nucl Acids Res. 12:4849-4863, Unger, B. et al. (1984) 
Gene 31: 103-108, Unger, B. et al. (1984) Nucl Acids Res. 12:7693-7703 and Tovar, 
K. et al. (1988) MoL Gen. Genet 215:76-80, respectively). 

As used herein, "tet operator", "tet operator sequence" or "tetCT is intended to 

25 encompass all classes of tet operator sequences, e.g. (e.g., TetO(A) (SEQ ID NO.: 62), 
TetO(B) (SEQ ID NO.: 63), TetO(C) (SEQ ID NO.: 64), TetO(D) (SEQ ED NO.: 65), 
TetO(E) (SEQ ID NO.: 66), TetO(G), TetO(H), TetO(J), and TetO(Z). Nucleotide 
sequences of these five classes of tet operators are described in Waters, S. H. et al. 
(1983) cited supra, Hillen, W. and Schollenmeier, K. (1983) cited supra, Stuber, D. 
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and Bujard, H. (1981) Proc. NatL Acad. Sci. USA 78:167-171, Unger, B. et al. (1984) 
cited supra and Tovar, K. et at. (1988) cited supra. 

As used herein, "trp operator", "trp operator sequence" or "trpO" is intended 
to encompass all classes of trp operator sequences consistent with the intended uses 
5 described herein. One example is the trp operator of SEQ ID NO.: 67. 

As used herein, u xyl operator", "xyl operator sequence" or "xylOT is intended 
to encompass all classes of xyl operator sequences consistent with the intended uses 
described herein. One example is the xyl operator of SEQ ID NO.: 68. 

10 BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a sequence map showing the Xyl-T5 fusion promoter (SEQ ID NO.: 
1) showing the following elements: (1) -45 AT Box; (2) -35 RNA polymerase 
recognition site; (3) a 17 bp intervening sequence that can be replaced by a second 
operator; (4) -10 RNA polymerase recognition site; (5) tkexyl operator; (6) multiple 

1 5 cloning sequence; (7 A) first rrnB terminator; and (7B) second rrnB terminator. 

FIG. 2 is a sequence map showing the Xyl-CP25 fusion promoter (SEQ ID 
NO.: 2) showing the following elements: (1) a portion of the xylR gene; (2) -35 RNA 
polymerase recognition site; (3) a 17 bp intervening sequence that can be replaced by a 
second operator; (4) -10 RNA polymerase recognition site; and (5) the xyl operator. 

20 Also shown are recognition sites for the restriction enzymes (a) RsrII and (b) Xhol. 

FIG. 3A illustrates one embodiment of the present promoter replacement 
method in which a linear double stranded promoter replacement construct comprising 
a nucleic acid encoding a selectable marker, a transcriptional terminator, and a 
regulatable fusion promoter is inserted into the chromosome of a cell expressing the 

25 RecE and RecT gene products of the Rac prophage or gene products having analogous 
functions. 

FIG. 3B illustrates one embodiment of the present promoter replacement 
method in which a linear single stranded promoter replacement construct comprising a 
5' portion homologous to a target sequence, a 3' portion homologous to a target 
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sequence and a regulatable fusion promoter is inserted into the chromosome of a cell 

expressing the X Beta and Gam proteins. 

FIG. 4A illustrates one embodiment of the operator insertion method in which 

a linear double stranded regulatory element insertion construct comprising a nucleic 
5 acid encoding a selectable marker, a transcriptional terminator, and an operator is 

inserted into the chromosome of cells expressing the RecE and RecT gene products of 

the Rac prophage or gene products having analogous functions. 

FIG. 4B illustrates one embodiment of the operator insertion method in which 

a linear single stranded regulatory element insertion construct comprising a 5' portion 
10 homologous to a target sequence, a 3 * portion homologous to a target sequence and an 

operator is inserted into the chromosome of a cell expressing the X Beta and Gam 

proteins. 

FIG. 5 shows a plasmid map of the pXyl-T5 P15a Staphylococcus expression 

vector. 

15 FIG. 6 shows a plasmid map for the pEPEF-X series of Enterococcus 

expression vectors. 

FIG. 7 shows a bar graph depicting the ratio of RNA measured from xylose 
induced cells versus non-induced cells. RNA measurements were made in real time 
by RT-PCR and the samples were normalized for loading by using parallel 16S rRNA 
20 controls. Samples were prepared and analyzed in duplicate and the averages are 
plotted. 

FIG. 8 shows a bar graph comparing the induced and uninduced activities of 
the Xyl-CP25, Xyl-P32, Xyl-P59 and Xyl-PlP2 fusion promoters in Enterococcus 
faecalis. The activity of each fusion promoter was determined by measuring the P- 
25 galactosidase activity produced by the expression of lacL-lacM reporter genes 
operably linked downstream of each fusion promoter. 

FIG. 9 shows a bar graph comparing the induced and uninduced activities of 
the Xyl-T5 and Xyl-CP25 fusion promoters in Enterococcus faecalis. A comparison 
of the induced and uninduced activities of the Xyl-T5 fusion promoter in 
30 Staphylococcus aureus and Enterococcus faecalis is also shown. The activity of each 
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fusion promoter was determined by measuring the p-galactosidase activity produced 
by the expression of lacL-lacM reporter genes operably linked downstream of each 
fusion promoter. 

FIG. 10 shows a bar graph depicting the activity of the Xyl-P59 fusion 
5 promoter over a range of xylose concentrations from 0 to 10%. The activity of Xyl- 
P59 was determined by measuring the P-galactosidase activity produced by the 
expression of lacL-lacM reporter genes operably linked downstream of this fusion 
promoter. 

FIG. 1 1 shows a bar graph comparing the induced and uninduced activities of 

10 the Xyl-P59, Xyl-PL, and modified Xyl-PL (Xyl-PLIO) fusion promoters in 
Enterococcus faecalis. The activity of each fusion promoter was determined by 
measuring the p-galactosidase activity produced by the expression of lacL-lacM 
reporter genes operably linked downstream of each fusion promoter. 

FIG. 12 is an IPTG dose response curve in £ coli transformed with an IPTG- 

15 inducible plasmid containing either an antisense clone to the £ coli ribosomal protein 
rpPN (AS-rplW) which is required for protein synthesis and essential for cell 
proliferation, or an antisense clone to the elaD (AS-elaD) gene which is not known to 
be involved in protein synthesis and which is also essential for proliferation. 

FIG. 13A is a tetracycline dose response curve in £ coli transformed with an 

20 IPTG-inducible plasmid containing antisense to rplW (AS-rplW) in the absence (0) or 
presence of IPTG at concentrations that result in 20% and 50% growth inhibition, 

FIG. 13B is a tetracycline dose response curve in £ coli transformed with an 
IPTG-inducible plasmid containing antisense to elaD (AS-e/aD)in the absence (0) or 
presence of IPTG at concentrations that result in 20% and 50% growth inhibition. 

25 FIG. 14 is a graph showing the fold increase in tetracycline sensitivity of £ 

coli transfected with antisense clones to essential ribosomal proteins L23 (AS-rplW) 
and L7/L12 and L10 (AS-rplLrpU). Antisense clones to genes known to not be 
directly involved in protein synthesis, atpB/E (AS-atpB/E ), visC (AS-wC), elaD 
(AS-elaD), yohH (AS-yohH), are much less sensitive to tetracycline. 
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FIG. 15 illustrates the results of an assay in which Staphylococcus aureus cells 
expressing an antisense nucleic acid complementary to the gyrB gene encoding the p 
subunit of gyrase were contacted #ith several antibiotics whose targets were known. 

5 DETAILED DESCRIPTION OF THE INVENTION 

The invention described herein involves the construction and use of fusion 
promoters that can be tightly and/or finely regulated (e.g., with a xyl, tet, mal t lac, trp, 
Acl, or other suitable operators), constructs and cells having these promoters, methods 
of expression of RNAs, peptides and proteins, methods to identify genes essential for 

10 the growth of bacteria, and methods to identify compounds that inhibit bacterial 
growth. 

Some embodiments of this invention contemplate the preparation of several 
different bacterial fusion promoters which contain at least one operator associated 
with a promoter sequence. The at least one operator is positioned such that the 

15 binding of at least one repressor to the operator represses transcription from the 
promoter. The promoter sequence may be a wildtype promoter obtained from a 
bacterium, virus or other microorganism or it may be a promoter sequence that has 
been modified to have altered activity in Gram-positive organisms. One embodiment, 
for example, is a fusion promoter comprising a promoter sequence modified to have 

20 altered activity in at least one Gram-positive organism linked to one or more operators 
including, but not limited to, the xylose operator ("xy/O"), the tetracycline operator 
("tetO"), the maltose operator ('WO"), lactose operator ("lacO"), the tryptophan 
operator ("*r/>0"), and the lambda CI operator {?XclO"). These operators can be 
induced by the following inducers, respectively: xylose or analog thereof, tetracycline 

25 or analog thereof, maltose or analog thereof, lactose or analog thereof, tryptophan or 
analog thereof, and temperature. 

In some embodiments, fusion promoters based on the bacteriophage T5 
promoter and the xylose operator are contemplated. The T5 promoter has been 
shown to be capable of directing the efficient expression of genes primarily in 

30 Escherichia coli. (See e.g„ Gentz et aL, Proc. NatL Acad. Sci. USA 78:4936-4940 
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(1981); Rommens et aL, Nucleic Acids Res. 11:5921-5940 (1983); Wang et at, 
Gene 242:105-114 (2000); and U.S. Pat Nos. 4,495,280 and 4,689,406) and the 
xylose operator/repressor system has been shown to regulate expression in some 
Gram-positive bacteria. (See e.g., Dahl et aL, J. MoL BioL 243:413-424 (1994); 
5 Sizemore et aL, J. BacterioL 174:3042-3048 (1992); Wieland et aL, Gene 158:91- 
96 (1995); Kreuzer et aL, J. BacterioL 171:3840-3845 (1989); Lokman et aL, J. 
BacterioL 179:5391-5397 (1997); and Erbeznik et aL, J. BacterioL 180:1103-1109 
(1998)). 

In a preferred embodiment, xylO is operably linked to a T5 promoter such that 

10 transcription from the fusion promoter is inducible by agents that inhibit the binding 
of the xylose repressor (encoded by xylR) to the xylose operator (xylO). The xylO can 
be the xylO from S. xylosis or B. subtilis or a number of other bacteria. This 
embodiment, referred to as the Xyl-T5 promoter, can also contain several other 
elements including, but not limited to, AT-rich boxes (AT Box) that enhance the 

15 efficiency of gene expression in Gram-positive bacteria. The use of an AT-Box- 
enchanced T5 promoter has been described by D.J. Henner in "Expression of 
Heterologous Genes in Bacillus subtilis" Methods EnzymoL, 195:199 (1990), the 
disclosure of which is incorporated herein by reference in its entirety. 

In some embodiments fusion promoters containing one or more AT Boxes are 

20 contemplated. The AT Box is positioned such that it increases the level of 
transcription from the fusion promoter in Gram-positive bacteria. For the Xyl-T5 
embodiment, a consensus -45 AT Box is shown in FIG. 1 at position (1). Many 
variations of the AT Box can be used. For example, the AT Box can have more or 
less adenine or thymine nucleotides or different sequences or combinations of adenine 

25 and thymine molecules. In this respect, the diversity of the AT box is limited only by 
the ability of this AT rich region to facilitate transcription in Gram-positive bacteria. 
In other embodiments of the present invention, fusion promoters constructed from T5 
promoter derivatives which have been modified to have altered activity in Gram- 
positive organisms are contemplated. 
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A preferred embodiment of the Xyi-T5 fusion promoter is shown in FIG.l. In 
italics, at positions (2) and (4) of FIG. I are shown the -35 and -10 RNA polymerase 
recognition sites (-35 Box and -10 Box), respectively. The 17 nucleotide long element 
shown at position (3) is a site in which a second operator can be inserted if desired. 

5 The second operator is positioned such that the binding of a repressor to the second 
operator represses transcription from the promoter. In some embodiments, the second 
operator may be the xylO, tetO, malO, lacO, trpO, or XclO. At position (5) is shown 
the xyl operator (xylO), which is placed just downstream of the -10 Box. The distance 
between xylO and the T5 promoter can be made shorter or longer provided the ability 

10 to sustain repression of the T5 promoter is maintained. In some embodiments, a 
multiple cloning sequence (designated at position (6)) is also inserted downstream of 
the xyl operator to allow nucleic acids to be transcribed from the promoter when 
operably linked to the promoter. Many different multiple cloning sequences can be 
inserted at position (6) and the multiple cloning sequence shown is only one example. 

15 Additionally, some embodiments can include a terminator sequence, to facilitate 
termination of the RNA transcribed from the promoter at a desired position (e.g., the 
first rrnB terminator shown at position (7 A) and/or the second rrnB terminator shown 
at position (7B)). Preferably, the terminator sequence allows for Rho independent 
transcription termination. 

20 The present invention also contemplates fusion promoters based on promoter 

sequences discovered in Lactococcus lactis linked to at least one operator. For 
example, the Lactococcus lactis promoters CP25 (Jensen and Hammer, AppL Environ. 
Microbiol 64:82-87 (1998), the disclosure of which is incorporated herein by 
reference in its entirety), P32 and P59 (Bruberg et ai., AppL Microbiol Biotechnol 

25 42:108-1 15 (1994), the disclosure of which is incorporated herein by reference in its 
entirety), P1P2 (Nilsson and Johansen, Biochimica Biophysica Acta, 1219:141-144 
(1994), the disclosure of which is incorporated herein by reference in its entirety), and 
PL (Madsen et al, Appl Environ. Microbiol 67:1 128-1 139, the disclosure of which is 
incorporated herein by reference in its entirety) can be fused to to at least one operator. 

30 In some embodiments of the present invention, fusion promoters constructed from 
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derivatives of the above Lactococcus lactis sequences which have been modified to 
have altered activity in Gram-positive organisms are contemplated. 

It will be appreciated that a regulatable fixsion promoter suite can be 
constructed which provides a wide dynamic range of both basal and maximal 

5 transcriptional activities. Each fusion promoter within this suite can be tightly and 
finely regulated over its own characteristic range of transcriptional activity. 
Furthermore, the regulatable fusion promoters within the suite can comprise different 
promoters operably linked to the same operators thus providing for a common means 
of regulation of the promoters within the suite. In some embodiments, the promoters 

10 within the set of regulatable fusion promoters within the suite can be linked to at least 
one operator contained in a promoterless expression vector that is capable of 
replicating in one or more Gram-positive organisms. Accordingly, a regulatable 
fusion promoter having an appropriate basal level and/or maximal level of 
transcriptional activity can be selected from the suite for use in the appropriate 

15 application. 

In some embodiments, fusion promoters based on promoter sequences 
discovered in Lactococcus actively promote transciption in Enterococcus faecalis and 
other Gram-positive organisms. One embodiment is the Xyl-CP25 fusion promoter 
shown in FIG. 2. Position (1) of FIG. 2 shows a portion of the xylR gene located 

20 upstream from the fusion promoter. Positions (2) and (4) show the -35 and -10 Boxes 
of the CP25 promoter sequence, respectively. The 17 nucleotide long element shown 
at position (3) is a site in which a second operator can be inserted if desired. The 
second operator is positioned such that the binding of a repressor to the second 
operator represses transcription from the promoter. In some embodiments, the second 

25 operator may be the xylO, tetO, malO, lacO, trpO, or telO operator. At position (5) 
is shown the xyl operator (xylO), which is placed just downstream of the -10 Box. The 
distance between xylO and the CP25 promoter can be made shorter or longer provided 
the ability to sustain repression of the CP25 promoter is maintained. The positions of 
unique Rsrll and Xhol recognition sites flanking the promoter are indicated as (a) and 

30 (b), respectively. In some embodiments, the Rsrll and Xhol sites can be used to 
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exchange the CP25 promoter with other promoter sequences, such as promoters 
selected from SEQ ID NOs.: 36 - 45. 

In other embodiments of the fiision promoters described herein, elements such 
as multiple cloning sites (MCS) and transcription terminators can be linked 

5 downstream. Preferably, the terminator sequence allows for Rho independent 
transcription termination. For example the rrnB terminator {See e.g, Orosz et al, 
Eur J. Biochem., 201:653-659 (1991), the disclosure of which is incorporated by 
reference in its entirety) may be utilized. 

Some fusion promoters of the present invention function in Staphylcoccus 

10 aureus and Enterococcus faecalis. Other fusion promoters described herein function 
in Gram-positive organisms such as Bacillus anthracis, Clostridium botulinum, 
Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium 
diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria 
monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia 

15 asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus 
xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling 
within the genera of any of the above species. 

In another aspect of the invention, two or more operators are linked to a 
promoter. The operators are positioned such that binding of repressors to the 

20 operators represses transcription from the promoter. The operators may be identical or 
different. By controlling gene expression with multiple operators (e.g., using a 
plurality of operators that can be selectively modulated in specific organisms), genetic 
information can be safely transferred between bacterial species and expression can be 
selectively induced and regulated. For example, repressors that bind to one or more of 

25 the operators may be expressed in a first bacterial species or strain but not in a second 
species. Advantageously, some embodiments maintain tight repression of a 
potentially lethal transcript in a first species or strain that is used to amplify the 
construct but the repression can be efficiently removed in a second species or strain in 
which transcription from the promoter is desired. 
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One embodiment, for example, concerns a fusion promoter that selectively 
enables the repression of transcription in Escherichia coli and Staphylococcus species. 
The M XyI/Lac-T5 n promoter comprises the lacO operably linked to the Xyl-T5 
promoter so that transcription of the genetic information associated with the promoter 

5 can be induced in Staphylococcus by adding xylose or an analog thereof and/or in 
Escherichia coli by adding lactose or an analog thereof (e.g., IPTG). This selective 
regulation may be a consequence of the fact that Staphylococcus does not express the 
lac repressor that binds to the lacO while Escherichia coli does not express the xylR 
that binds to the xylO. In this manner, the construct can be efficiently amplified in 

10 Escherichia coli and retain the ability to express the desired transcript in 
Staphylococcus once the binding of xylR to xylO is inhibited by adding an inducer 
such as xylose or an analog thereof. 

Similar multioperator constructs, which incorporate both lacO and xylO so as 
to provide regulatable expression in both Escherichia coli and Gram-positive 

15 organisms, can be made using promoter sequences described herein, including 
promoters selected from SEQ ID NOs.: 26 - 35. 

By following the approaches described in the examples below, with 
modifications as would be apparent to one of skill in the art, the preparation of a 
fusion promoter comprising operators other than those described in the examples can 

20 be accomplished without undue burden or extensive experimentation. 

The present invention also contemplates fusion promoters that have been 
modified to have altered activity in Gram-positive organisms. Modifications to fusion 
promoters can be made, for example, by synthesizing the modified DNA sequences, 
PCR based mutagenesis, site-directed mutagenesis techniques, DNA shuffling or any 

25 other techniques for making site specific mutations. 

In some embodiments of the present invention, nucleotides in the -10, -35 or - 
45 Boxes of a promoter are changed to make those sequence elements look more or 
less like the consensus -10, -35 or -45 Box sequences in the desired Gram-positive 
organisms. Such modification can have the affect of either increasing or decreasing 

30 the activity of the fusion promoter. 
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It will be appreciated that a regulatable fusion promoter suite can be 
constructed which provides a wide dynamic range of both basal and maximal 
transcriptional activities. It will also be appreciated that modifying a regulatable 
fusion promoter so as to increase or decrease its activity can be used to expand the 

5 range of both basal and maximal transcriptional activities of a suite of regulatable 
fusion promoters. Each modified fusion promoter within this suite can be tightly and 
finely regulated over its own characteristic range of transcriptional activity. 
Furthermore, the modified regulatable fusion promoters within the suite can comprise 
different promoters operably linked to the same operators thus providing for a 

10 common means of regulation of the promoters within the suite. In some 
embodiments, the promoter within the set of modified regulatable fusion promoters 
within the suite can be linked to at least one operator contained in a promoterless 
expression vector that is capable of replicating in one or more Gram-positive 
organisms. Accordingly, a regulatable fusion promoter having an appropriate basal 

15 level and/or maximal level of transcriptional activity can be selected from the suite for 
use in the appropriate application. 

In other embodiments, promoters are modified by altering the length of the 
region between the operator or transcription start site and the -10 Box, altering the 
length and/or composition of the region between the -10 Box and the -35 Box, and 

20 altering the length and/or composition of the AT rich region upstream of the -35 Box 
(i.e. the -45 Box). It will also be appreciated that modifications which affect promoter 
activity can include addition of elements such as the CRE region for catabolite 
repression in Gram-positive bacteria. These additional elements can be further 
modified so that they are more or less like their corresponding consensus sequence. 

25 In some embodiments, modified fusion promoters can function in one or more 

Gram-positive organisms. For example, some modified fusion promoters of the 
present invention function in Staphycoccus aueus and Enterococcus faecalis. Other 
modified fusion promoters described herein function in Griam-positive organisms such 
as Bacillus anthracis. Clostridium botulinum, Clostridium difficile, Clostridium 

30 perfiingens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, 
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Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium 
leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, 
Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, 
Streptococcus mutans or any species falling within the genera of any of the above 
5 species. 

In some embodiments of the present invention, the promoters described herein 
are used to initiate transcription of an RNA (sense or anti-sense) in bacteria, desirably 
Gram-positive bacteria. In another embodiment, random polynucleotides (e.g., 
nucleic acids generated by oligonucleotide synthesis, shearing of bacterial genomic 

10 DNA or enzymatic digestion, such as digestion with restriction endonucleases, DNase 
I or other nucleases) are operably linked to a promoter described herein. These 
constructs are transferred to bacteria and can be used to transcribe RNA. 

In one embodiment of the present invention, a fusion promoter comprising an 
operator, such as xylO, and a promoter, such one selected from SEQ ID NOs.: 36 - 

15 45, is operably linked to a nucleic acid to be transcribed from the promoter, the 
construct is introduced into bacteria, a solution of xylose or an analog thereof is 
provided (e.g., a 2.0% xylose solution), and induction of transcription proceeds. 
Preferred nucleic acids that are operably linked to the fusion promoter are at least 6, at 
least 10, at least 15, at least 20, at least 100, at least 200, at least 300, at least 400, at 

20 least 500, at least 800 or more than 800 nucleotides in length and, in some 
embodiments, the nucleic acids that are operably linked to the fusion promoter encode 
a transcript that is complementary to at least 6, at least 10, at least 15, at least 20, at 
least 100, at least 200, at least 300, at least 400, at least 500, at least 800 or more than 
800 nucleotides within the genome of a Gram-positive organism, such as 

25 Staphylococcus or Enterococcus, which code for a gene product such as a protein or 
RNA. That is, nucleic acids that are operably linked to the fusion promoter can be in 
the "sense" or "antisense" orientation and can be at least 6, 7, 8, 9, 10, 11, 12, 13, 14, 
15, 16, 17, 18, 19, 20, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 
37, 38, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 

30 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 
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84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 
250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800 or more than 800 
nucleotides in length. 

In another embodiment, the promoters described herein are used in an 

5 expression construct to initiate the production of a polypeptide or protein in bacteria, 
desirably Gram-positive bacteria. In some aspects of this embodiment, the peptide or 
protein generated corresponds to an expression product encoded by a full-length gene, 
a portion of a full-length gene, or a random polynucleotide or fragment of DNA 
generated by oligonucleotide synthesis or enzymatic digestion of genomic DNA (e.g., 

1 0 bacteria, yeast, or mold) . 

In one embodiment of the present invention, a fusion promoter comprising an 
operator, such as xylO, and a promoter, such one selected from SEQ ID NOs.: 36 - 
45, is operably linked to a nucleic acid to be transcribed from the promoter, the 
construct is introduced into bacteria, a solution of xylose or an analog thereof is 

1 5 provided (e.g., a 2.0% xylose solution), and induction of transcription proceeds. This 
expression method can be performed in Gram-positive or Gram-negative bacteria and 
are desirably performed in Staphylococcus, Enterococcus or other Gram-positive 
species. 

The DNA encoding the desired molecule can be generated by oligonucleotide 
20 synthesis or enzymatic cleavage of DNA (e.g., random fragments of bacterial genomic 
DNA including, but not limited to, Staphylococcus) The peptides expressed by using 
a fusion promoter can be polypeptides, proteins, and fusion proteins. The peptides can 
be, for example, 3-20 amino acids in length, 20-100 amino acids in length, or 100, 
200, 300, 400, 500, 600, 700, 800, or more amino acids in length. That is, the 
25 peptides can be at least 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, 16, 17, 18, 19, 20, 20, 
21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 40, 41, 42, 43, 44, 
45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 
68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 
91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 250, 300, 350, 400, 450, 
30 500, 550, 600, 650, 700, 750, 800 or more than 800 amino acids in length. 
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Regulatable promoters providing a range of transcriptional levels in both the 
repressed and induced states, such as the promoters described herein, are desirable in 
applications where production of defined levels of transcript or protein is necessary. 
For example, promoters which can be used to tightly and/or finely regulate 

5 transcription at a desired level are desirable for use in cell-based assays which employ 
sensitized cells for screening potential antibiotic compounds (described in Examples 
19-25). Specifically, it is desirable to use regulatable fusion promoters, such as those 
described herein, to modulate the sensitivity of cells to test compounds by altering the 
amount of a proliferation-inhibiting antisense nucleic acid transcribed by the 

10 promoter. Because the results of such cell-based assays are affected by the degree to 
which the cells are sensitized, the efficacy of these assays can be increased by using 
promoters having the ability to finely regulate the level of proliferation-inhibiting 
transcript over a wide range of transcript concentrations. 

The present invention contemplates fusion promoters which can be used to 

1 5 achieve the desired level of nucleic acid transcription or protein expression desired for 
applications such as the cell-based gene and drug discovery methods described herein. 
The appropriate fusion promoter can be selected by first determining the approximate 
level of transcript production that is required for the assay. For example, if only a low 
level of transcription is necessary, a promoter with a low maximum transcription 

20 efficiency may be used. If a tightly controlable transcription over a wide range is 
required, a fusion promoter having nonleaky repression and a high maximum 
transcription efficiency may used. The amount of actual transcription obtained from 
the fusion promoter can then be fine tuned by determining the concentration of 
inducer which achieves the desired effect. Inducer can be introduced to the ceil 

25 cultures as a solid or a liquid. The inducer can also be an appropriate growth 
condition, including but not limited to, the absense of a regulatory molecule, a change 
in temperature or exposure to radiation. The inducer can comprise from 0% to greater 
than 25% of the concentration of the growth medium or can be an appropriate growth 
condition. In some enbodiments, the inducer comprises 0%, 0.1%, 0.2%, 0.3%, 0.4%, 

30 0.5%, 0.6%, 0.7%, 0.8%, 0.9%, 1.0%, 1.5%, 2.0%, 3.0%, 4.0%, 5.0%, 6.0%, 7.0%, 
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8.0%, 9.0%, 10.0%, 12.0%, 15%, 17%, 20%, 25%, or greater than 25% of the 
concentration of the growth medium. 

The present invention includes vectors containing fusion promoters operably 
linked to a reporter gene. Such expression vectors can be used to quantitate the 
5 transcriptional activity of the fusion promoter contained therein. In some 
embodiments, a promoter can be inserted into the reporter vector as a cassette so as to 
form a fusion promoter construct with an operator located on the vector upstream of 
the reporter gene. In a prefered embodiment, the P-galactosidase encoding genes, 
lacL-lacM, from Leuconostoc mesenteroides, (Israelsen et aL, Appl Environ. 

10 Microbiol 61:2540-2547 (1995); Pedersen et aL, Mol Gen. Genet 244:374-382 
(1994), the disclosures of which are incorporated by reference in their entireties), are 
linked to a fusion promoter described herein. The present invention also contemplates 
methods for using the lacL-lacM genes as a reporter in Enterococcus faecalis. 

Expression vectors containing fusion promoters linked upstream of a multiple 

15 cloning site (MCS) and/or a transcription terminator are also contemplated by the 
present invention. In a preferred embodiment the transcription terminator is a rho- 
independent terminator such as rrnBthl. In some embodiments of the present 
invention, a promoter sequence can be inserted into the expression vector as a cassette 
so as to form a fusion promoter construct with an operator located on the vector 

20 upstream of the MCS. In other embodiments, a promoter linked to an operator can be 
inserted into the expression vector as a cassette just upstream of the MCS. 

Expression vectors containing fusion promoters can be used to produce 
transcripts, such as antisense RNA, or for the expression of proteins. In some 
embodiments, nucleic acids corresponding to proliferation-required genes or 

25 fragments thereof or random genomic fragments are inserted into the MCS in a sense 
or antisense orientation. Upon induction of the fusion promoter with an inducer, 
antisense or sense transcripts are produced. Expression of such transcripts from the 
fusion promoter can be tightly and/or finely regulated. 

In a preferred embodiment of the present invention, reporter and expression 

30 vectors contain both an origin of replication that is functional in Escherichia coli 
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and/or other Gram-negative organisms and an origin of replication that function in one 
or more Gram-positive organisms such as Bacillus anthracis, Clostridium botulinum, 
Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium 
diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria 
5 monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia 
asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus 
xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling 
within the genera of any of the above species. In some embodiments, an origin of 
replication is functional in Staphylococcus aureus and Enterococcus faecalis. 
10 In some embodiments, nucleic acids that are operably linked to the fusion 

promoter encode a molecule (e.g., a sense RNA, anti-sense RNA, aptamer, or peptide) 
that inhibits proliferation of Gram-positive bacteria including, but not limited to, 
Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium 
perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, 
15 Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium 
leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, 
Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, 
Streptococcus mutans or any species falling within the genera of any of the above 
species. Certain embodiments include methods of identifying genes that are essential 
20 to bacterial proliferation ("essential genes") whereby the constructs described above 
are selectively induced in bacteria, such as pathogenic bacteria, at various levels and 
the impact on cell proliferation is monitored. For example, the fusion promoters of 
the present invention may be utilized in any of the methods described in United States 
Patent Application Serial Number 09/492,709, United States Patent Application Serial 
25 Number 09/71 1,164, United States Patent Application Serial Number 09/741,699, and 
United States Patent Application Serial Number 09/815,242, the disclosures of which 
are incorporated by reference herein in their entireties. 

One such method is practiced by introducing a candidate nucleic acid into a 
microorganism, wherein the candidate nucleic acid is operably linked to a fusion 
30 promoter comprising a xyl, tet, trp, mal, lac or Ac] operator and one of the promoter 
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sequences described herein. The fusion promoter may comprise a promoter that has 
been modified to to alter its transcriptional activity in one or more Gram-positive 
bacteria including but not limited to Bacillus anthracis, Clostridium botulinum, 
Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium 

5 diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria 
monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia 
asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus 
xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling 
within the genera of any of the above species. In some embodiments, the candidate 

10 nucleic acid is generated by random oligonucleotide synthesis and in other 
embodiments, the candidate nucleic acid is generated by cleavage of genomic DNA 
from an organism (e.g., bacteria, viruses, fungi, mold, parasites, yeast, plants and 
animals). For example, the candidate nucleic acid may be a random fragment 
generated by methods such as DNasel digestion, restriction endonuclease digestion or 

15 shearing. In some cases, the candidate nucleic acid will be made so that an antisense 
RNA, sense RNA, aptamer, peptide, or protein will be expressed upon induction of 
the fusion promoter. In some cases, the candidate nucleic acid will encode a transcript 
that corresponds to or is complementary to a gene present in the genome of an 
organism (e.g., bacteria, viruses, fungi, mold, parasites, yeast, plants and animals). In 

20 other cases, the candidate nucleic acid will encode a peptide that corresponds to or 
interacts with a protein in the bacteria. 

In the embodiments in which an aptamer is transcribed upon induction of the 
fusion promoter, the aptamer may be a nucleic acid which directly binds to the target 
molecules. Alternatively, the aptamer may encode a peptide that binds to the target 

25 molecule. For example, the aptamer may encode a peptide of at least 4, 6, 8, 10, 15, 
20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70 or more than 70 amino acids in length. 
Aptamers which encode peptides have been described in PCT publication WO 
99/50462, the disclosure of which is incorporated herein by reference in its entirety. 

For example, RNA aptamers corresponding to regions of Tat and Rev RNA 

30 were introduced into or expressed in a cell and were shown to be specific RNA 
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ligands for proteins that can specifically inhibit their translation. (Good et at, 1997, 
Gene Therapy 4: 45-54). Similarly, Conrad et al (J. Biol. Chem. 269: 32051, 1994) 
have shown that certain RNA aptamers can inhibit the P isoenzyme of PKC. These 
RNA aptamers were selected from a pool of RNA molecules that contained a 120- 
5 nucleotide randomized region. 

In general, the aspects of the invention that employ aptamer technology 
concern the use of the fusion promoters described herein to drive expression of 
aptamers of varying lengths and compositions. A minimum of approximately 6 
nucleotides, preferably 10, and more preferably 14 or 15 nucleotides, may be 
10 necessary to effect specific binding to some target molecules. Thus, in some 
embodiments, fusion promoters of the invention are linked to oligonucleotides 
encoding aptamers that are 6, 10, 14, or 15 nucleotides in length. The only apparent 
limitations on the binding specificity of the target/aptamer complex concern sufficient 
sequence to be distinctive in the binding aptamer and sufficient binding capacity of the 
15 target molecule to obtain the necessary interaction. Aptamers of binding regions 
containing sequences shorter than 10, e.g., 6-mers, are feasible if the appropriate 
interaction can be obtained in the context of the environment in which the target is 
placed. Thus, if there are few interferences by other materials, less specificity and less 
strength of binding may be required. 
20 Several methods for selecting aptamer oligonucleotides directed against a 

specific protein are known in the art and these approaches can be employed to design 
constructs having the fusion promoters linked to an oligonucleotide that encodes an 
aptamer. Examples of these methods include those methods described by Mills et al 
Proc. Natl. Acad. Sci. USA 58:224 (1967), Green et al Nature 347:406 (1990), and 
25 Tuerk and Gold Science 249:505 (1990), (ail references are expressly incorporated by 
reference in their entireties), which show the use of in vitro evolution techniques to 
select RNA molecules with desired properties. Ellington and Szostak Nature 346:818 
(1990), and Bock et al Nature 355:564 (1992) (all references are expressly 
incorporated by reference in their entireties) also describe techniques useful for the 
30 selection of DNA oligonucleotides. More teachings on the design and selection of 
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apatamers can be found in U.S. Pat. Nos. 5,756,291; 5,874,567; 5,874,281; 5,998,596; 
and 5,965,352 (all references are expressly incorporated by reference in their 
entireties). The field has also been recently reviewed by Burke and Berzal-Herranz 
FASEB J 7:106 (1993), herein expressly incorporated by reference in its entirety. 

5 Once the construct comprising the candidate nucleic acid operably linked to a 

promoter of the invention is made, it is transferred to host bacteria. Next, the 
proliferation of microorganisms transcribing a higher level of the candidate nucleic 
acid is compared to the proliferation of microorganisms that transcribe a lower level of 
the candidate nucleic acid or that do not transcribe the candidate nucleic acid. These 

10 varying levels of transcription can be achieved by varying the amounts of inducer 
(e.g., xylose, tetracycline, maltose, the absense of tryptophan, temperature, exposure to 
ultraviolet radiation, or analogs thereof). The level of maximal induction that can be 
achieved will depend on the fusion promoter selected. A difference in cell 
proliferation in the sample transcribing a higher level of the candidate nucleic acid 

15 compared to the sample expressing a lower level of the candidate nucleic acid 
indicates that the candidate nucleic acid encodes a nucleic acid that is complementary 
to an essential gene or encodes a peptide or protein that reduces proliferation of the 
microorganism. 

In some embodiments, the RNA, aptamer, peptide or protein reduces 
20 proliferation of the microorganisms by interacting with another molecule required for 
proliferation. In this case, many techniques in molecular biology can be employed to 
identify the cell proliferation molecule that interacts with the aptamer, peptide, or 
protein and the gene encoding this molecule. Conventional one and two hybrid 
systems, for example, can be readily adapted to identify molecules that bind to an 
25 aptamer, peptide, or protein described above, for example. Such approaches include: 

(1) the two-hybrid systems (Field & Song, Nature 340:245-246 (1989); 
Chien et al y Proc. Natl. Acad ScL USA 88:9578-9582 (1991); and 
Young KH, Biol. Reprod. 58:302-311 (1998), all of which are 
30 expressly incorporated by reference in their entirety); 
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(2) the reverse two-hybrid system (Leanna & Hannink, Nucl Acid Res. 
24:3341-3347 (1996), herein incorporated by reference in their 

entirety); 

(3) the repressed transactivator system (Sadowski et at, U.S. Pat. No. 
5 5,885,779), herein incorporated by reference in their entirety); 

(4) phage display systems (Lowman HB, Annu. Rev. Biophys. Biomol 
Struct. 26:401-424 (1997), herein incorporated by reference in their 
entirety); and 

(5) GST/HIS pull down assays, mutant operators (Granger et a/., WO 
10 98/01879) and the like {See also Mathis G., Clin. Chem. 41:139-147 

(1995); Lam K.S. Anticancer Drug Res., 12:145-t67 (1997); and 
Phizicky et at., Microbiol. Rev. 59:94-123 (1995), all of which are 
expressly incorporated by reference in their entirety). 

15 In one embodiment, the present invention utilizes an antisense-based method to 

identify proliferation-required sequences. Generally, a library of nucleic acids from a 
given source are subcloned or otherwise inserted immediately downstream of an 
inducible fusion promoter on an appropriate vector, such as one of the expression 
vectors described herein, thus forming an expression library. It is generally preferred 

20 that expression is directed by a regulatable fusion promoter such that expression level 
can be adjusted by addition of variable concentrations of an inducer molecule or of an 
inhibitor molecule to the medium. Temperature activated fusion promoters, such as 
promoters regulated by temperature sensitive repressors, such as the lambda Ci&57 
repressor, are also envisioned. Although the insert nucleic acids may be derived from 

25 the chromosome of the cell or microorganism into which the expression vector is to be 
introduced, because the insert is not in its natural chromosomal location, the insert 
nucleic acid is an exogenous nucleic acid for the purposes of the discussion herein. 

Once generated, the expression library containing the nucleic acids is introduced 
into a population of cells (such as the organism from which the nucleic acid sequences 

30 were obtained or an organism other than the organism from which the sequences were 
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obtained) to search for genes that are required for bacterial proliferation. Because the 
library molecules are foreign, in context, to the population of cells, the expression 
vectors and the nucleic acid segments contained therein are considered exogenous 
nucleic acid. 

5 Expression of the nucleic acid fragments in the test population of cells 

containing the expression library is then activated Activation of the expression vectors 
consists of subjecting the cells containing the vectors to conditions that result in the 
expression of the nucleic acids carried by the expression library. The test population of 
cells is then assayed to determine the effect of expressing the nucleic acid fragments on 

1 0 the test population of cells. Those expression vectors that negatively impact the growth 
of the cells upon induction of expression of the random sequences contained therein are 
identified, isolated, and purified for further study. 

A variety of assays are contemplated to identify nucleic acid sequences that 
negatively impact growth upon expression. In one embodiment, growth in cultures 

15 expressing nucleic acid sequences and growth in cultures not expressing these sequences 
is compared. Growth measurements are assayed by examining the extent of growth by 
measuring optical densities. Alternatively, enzymatic assays can be used to measure 
bacterial growth rates to identify nucleic acid sequences of interest. Colony size, 
colony morphology, and cell morphology are additional factors used to evaluate growth 

20 of the host cells. Those cultures that fail to grow or grow at a reduced rate under 
expression conditions are identified as containing an expression vector encoding a 
nucleic acid fragment that negatively affects a proliferation-required gene. 

In some embodiments, genomic fragments of an organism are operably linked to 
an inducible fusion promoter in a vector and assayed for growth inhibition activity. 

25 Examples below describe the examination of libraries of genomic fragments cloned into 
vectors comprising inducible fusion promoters. Vectors containing fusion promoters, 
such as those described herein, can be used. Upon transformation of these vectors into a 
host organism and induction, the vectors produce an RNA molecule corresponding to 
the subcloned genomic fragments. In those instances where a genomic fragment is in ah 

30 antisense orientation with respect to the fusion promoter, the antisense transcript 
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produced can be complementary to at least a portion of a sense messenger RNA 
(mRNA) encoding a proliferation-required gene product such that the antisense 
transcript interacts with at least a portion of the proliferation-required sense mRNA 
produced by the host organism thereby decreasing the translation efficiency or the level 

5 of the proliferation-required sense mRNA and thus decreasing production of the 
proliferation-required protein encoded by the sense mRNA molecule. In cases where the 
sense mRNA encodes a protein required for proliferation, the host organism containing a 
vector from which transcription from the fusion promoter has been induced fails to grow 
or grows at a substantially reduced rate. Additionally, in cases where the transcript 

1 0 produced is complementary to at least a portion of a non-translated RNA and where that 
non-translated RNA is required for proliferation, host cells containing a vector from 
which transcription from the fusion promoter has been induced also fail to grow or grow 
at a substantially reduced rate. 

In cases where the candidate nucleic acid encodes an antisense nucleic acid 

15 comprising a nucleotide sequence that is complementary to a gene required for 
proliferation or a portion thereof, the gene sequence of the coding fragment of the 
proliferation-required gene or a portion thereof can be determined by direct 
sequencing. 

Determination of the gene(s) corresponding to the nucleotide sequence can 
20 achieved by comparing the obtained sequence data with databases containing known 
protein and nucleotide sequences from various microorganisms. Thus, initial gene 
identification can made on the basis of significant sequence similarity or identity to 
either characterized or predicted from Staphylococcus aureus and Enter ococcus faecalis 
genes or their encoded proteins and/or homologues in other species. 
25 The number of nucleotide and protein sequences available in database systems 

has been growing exponentially for years. For example, the complete nucleotide 
sequences of Caenorhabditis elegans and several bacterial genomes, including B. coli, 
Aeropyrum pernix, Aquifex aeolicus, Archaeoglobus fulgidus, Bacillus subtilis, 
Borrelia burgdorferi, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium 
30 tetani, Corynebacterium diptheria, Deinococcus radiodurans, Haemophilus 



-52- 



WO 02/051982 



PCT/US01/50250 



influenzae, Helicobacter pylon 26695, Helicobacter pylori J99, Methanobacterium 
thermoautotrophicum, Methanococcus jannaschii, Mycobacterium tuberculosis, 
Mycoplasma genitalium. Mycoplasma pneumoniae, Pseudomonas aeruginosa, 
Pyrococcus abyssi, Pyrococcus horikoshii, Rickettsia prowazekii, Synechocystis 

5 PCC6803, Thermotoga maritima, Treponema pallidum, Bordetetta pertussis, 
Campylobacter jejuni, Clostridium acetobutylicum, Mycobacterium tuberculosis 
CSUU93, Neisseria gonorrhoeae, Neisseria meningitidis, Pseudomonas aeruginosa, 
Pyrobaculum aerophilum, Pyrococcus furiosus, Rhodobacter capsulatus, Salmonella 
typhimurium, Streptococcus mutans, Streptococcus pyogenes, Ureaplasma 

10 urealyticum and Vibrio cholera are available. This nucleotide sequence information is 
stored in a number of databanks, such as GenBank, the National Center for 
Biotechnology Information (NCBI), the Genome Sequencing Center 
(http://genome.v^tLedu/gsc/salmonell£ushtml), and the Sanger Centre 
(http://ww.sanger.ac.uk/projects/S__typhi) which are publicly available for searching. 

15 A variety of computer programs are available to assist in the analysis of the 

sequences stored within these databases. FASTA, (W. R. Pearson (1990) "Rapid and 
Sensitive Sequence Comparison with FASTP and FASTA" Methods in Enzymology 
183:63- 98), Sequence Retrieval System (SRS), (Etzold & Argos, SRS an indexing 
and retrieval tool for flat file data libraries. Comput. Appl. Biosci. 9:49-57, 1993) are 

20 two examples of computer programs that can be used to analyze sequences of interest. 
In one embodiment of the present invention, the BLAST family of computer 
programs, which includes BLASTN version 2.0 with the default parameters, or 
BLASTX version 2.0 with the default parameters, is used to analyze nucleotide 
sequences. 

25 BLAST, an acronym for "Basic Local Alignment Search Tool " is a family of 

programs for database similarity searching. The BLAST family of programs includes: 
BLASTN, a nucleotide sequence database searching program, BLASTX, a protein 
database searching program where the input is a nucleic acid sequence; and BLASTP, a 
protein database searching program. BLAST programs embody a fast algorithm for 

30 sequence matching, rigorous statistical methods forjudging the significance of matches, 
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and various options for tailoring the program for special situations. Assistance in using 
the program can be obtained by e-mail at blast@ncbijilm.nih.gov. tBLASTX can be 
used to translate a nucleotide sequence in all three potential reading frames into an 
amino acid sequence. 

5 Bacterial genes are often transcribed in polycistronic groups. These groups 

comprise operons, which are a collection of genes and intergenic sequences under 
common regulation. The genes of an operon are transcribed on the same mRNA and are 
often related functionally. Given the nature of the screening protocol, it is possible that 
the identified exogenous nucleic acid corresponds to a gene or portion thereof with or 

10 without adjacent noncoding sequences, an intragenic sequence (i.e. a sequence within a 
gene), an intergenic sequence (i.e. a sequence between genes), a nucleotide sequence 
spanning at least a portion of two or more genes, a 5* noncoding region or a 3' 
noncoding region located upstream or downstream from the actual nucleotide sequence 
that is required for bacterial proliferation. Accordingly, it is often desirable to determine 

15 which gene(s) that is encoded within the operon is individually required for 
proliferation 

In one embodiment of the present invention, an operon is identified and then 
dissected to determine which gene or genes are required for proliferation. Operons 
can be identified by a variety of means known to those in the art. For example, the 

20 RegulonDB DataBase described by Huerta et al. (Nucl. Acids Res. 26:55-59, 1998), 
which may also be found on the website 
http://www.cifti.unam.mx/Computational_Biology/regulondb/ , the disclosures of 
which are incorporated herein by reference in their entireties, provides information 
about operons in Escherichia coli. The Subtilist database 

25 (http://bioweb.pasteur.fr/GenoList/SubtiList), ( Moszer, L, Glaser, P. and Danchin, A. 
(1995) Microbiology 141: 261-268 and Moszer, I (1998) FEBS Letters 430: 28-36, the 
disclosures of which are incorporated herein in their entireties), may also be used to 
predict operons. This database lists genes from the fully sequenced, Gram-positive 
bacteria, Bacillus subtilis, together with predicted promoters and terminator sites. 

30 This information can be used in conjunction with the Staphylococcus aureus genomic 
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sequence data to predict operons and thus produce a list of the genes affected by the 
antisense nucleic acids of the present invention. The TIGR microbial database has an 
incomplete version of the Enterococcus faecalis genome http://www.tigr.org/cgi- 
bin/BlastSearc!^last.cgi?organism=e_faecalis. One can take a nucleotide sequence 
5 and BLAST it for homologs. 

A number of techniques that are well known in the art can be used to dissect the 
operon. Analysis of RNA transcripts by Northern blot or primer extension techniques 
are commonly used to analyze operon transcripts. In one aspect of this embodiment, 
gene disruption by homologous recombination is used to individually inactivate the 

1 0 genes of an operon that is thought to contain a gene required for proliferation. 

Several gene disruption techniques have been described for the replacement of 
a functional gene with a mutated, non-functional (null) allele. These techniques 
generally involve the use of homologous recombination. One technique using 
homologous recombination in Staphylococcus aureus is described in Xia et a.. 1999, 

15 Plasmid 42: 144-149, the disclosure of which is incorporated herein by reference in its 
entirety. This technique uses crossover PCR to create a null allele with an in-frame 
deletion of the coding region of a target gene. The null allele is constructed in such a 
way that nucleotide sequences adjacent to the wild type gene are retained. These 
homologous sequences surrounding the deletion null allele provide targets for 

20 homologous recombination so that the wild type gene on the Staphylococcus aureus 
chromosome can be replaced by the constructed null allele. Enterococcus faecalis 
genes can be disrupted by recombining in a non-replicating plasmid that contains an 
internal fragment to that gene (Leboeuf, G, L. Leblanc, Y. Auffray and A. Hartke. 
2000. J. Bacterid. 182:5799-5806, the disclosure of which is incorporated herein by 

25 reference in its entirety). 

The crossover PCR amplification product is subcloned into a suitable vector 
having a selectable marker, such as a drug resistance marker. In some embodiments 
the vector may have an origin of replication which is functional in Escherichia coli or 
another organism distinct from the organism in which homologous recombination is to 

30 occur, allowing the plasmid to be grown in Escherichia coli or the organism other than 
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that in which homologous recombination is to occur, but may lack an origin of 
replication functional in Staphylococcus aureus or Enterococcus faecalis such that 
selection of the selectable marker requires integration of the vector into the 
homologous region of the Staphylococcus aureus or Enterococcus faecalis 

5 chromosome. Usually a single crossover event is responsible for this integration event 
such that the Staphylococcus aureus or Enterococcus faecalis chromosome now 
contains a tandem duplication of the target gene consisting of one wild type allele and 
one deletion null allele separated by vector sequence. Subsequent resolution of the 
duplication results in both removal of the vector sequence and either restoration of the 

10 wild type gene or replacement by the in-frame deletion. The latter outcome will not 
occur if the gene should prove essential. A more detailed description of this method is 
provided below. It will be appreciated that this method may be practiced with any 
genes identified as required for proliferation in any of the organisms described herein. 
Bacteria that harbor the expression vectors described herein can be induced to 

15 transcribe a desired level of RNA or to express a desired level of peptide, which 
affects a gene that is essential for the viability and/or growth of the cell and renders 
the cells more sensitive to compounds that reduce cellular proliferation. This 
technique is referred to as Essential Gene Interference (EGI). In one embodiment, for 
example, an RNA, such as an antisense RNA, or peptide generated from the constructs 

20 described above interacts with an essential gene of a bacteria, RNA, or product thereof 
and when the level of transcript or peptide is increased in the cell, viability and/or cell 
growth is compromised. Thus, the genes that are affected by the transcript or peptide 
either directly or indirectly (e.g., by interacting with the RNA or protein that 
correspond to the gene) causing a reduction in viability and/or cell growth are 

25 identified as essential genes. These essential genes are then used as drug discovery 
targets for high throughput drug screening. 

The promoters described herein can also be used to replace endogenous 
promoters so as to create cells having specific regulatable genes. By this approach, an 
essential gene is identified (e.g., using the techniques described herein) and the 

30 endogenous promoter is replaced with a regulatable promoter of the invention (e.g, 
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Xyl-T5, Xyl-T5-DD, Xyl-CP25, Xyl-P32, Xyl-P59, Xyl-P59» Xyl-PlP2, Xyi-PlP2\ 
Xyl, PL, and XylPLlO) using ET cloning technology. This technique is referred to as 
Rapid Replacement of Genomic Promoters (promoter replacement). This method 
employs the Polymerase Chain Reaction (PCR) to replace the promoter of the target 
5 gene without interrupting the endogenous Shine-Dalgarno sequence. As little as 60bp 
homology to the target promoter can be used. The correct integration of the cassette is 
confirmed by colony PCR and correct integrants are tested for growth defects when an 
inducer is titrated away. The inability to grow in the absence of inducer confirms the 
essentiality of the target gene. Additionally, the bacterial strains created using this 

10 approach can be used in high throughput drug screening methodologies. 

A similar technique to promoter replacement can be used to insert an operator 
into a genomic promoter region so as to make the genomic promoter regulatable. This 
technique is referred to as operator insertion. As little as 60bp homology to the target 
promoter region can be used. The correct integration of the cassette is confirmed by 

15 colony PCR and correct integrants are tested for growth defects when an inducer is 
titrated away. The inability to grow in the absence of inducer confirms the essentiality 
of the target gene. Additionally, the bacterial strains created using this approach can 
be used in high throughput drug screening methodologies. 

In some embodiments, promoter replacement and operator insertion methods 

20 are conducted in cells which have an increased frequency of homologous 
recombination. For example, the organism may lack or have a reduced level or 
activity of one or more exonucleases which would ordinarily degrade the DNA to be 
inserted into the chromosome, thereby effectively increasing the stability of the linear 
DNA in the organism. 

25 In still further embodiments, the strains may both lack or have reduced levels 

of exonucleases and constitutively or conditionally express proteins involved in 
mediating homologous recombination. For example, Gram-positive bacterial strains 
in which the activity of an exonuclease which degrades linear nucleic acids, such as 
exonuclease V of the RecBCD recombination pathway or an exonuclease with an 

30 analogous function, has been reduced or eliminated, may be used. Such strains 
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include those having mutations in the recB, recC, or recD genes or genes that express 
proteins having analogous functions, that enhance the frequency of homologous 
recombination. In some embodiments, the strains have mutations in more than one of 
the recB, recC, or recD genes or genes that express proteins having analogous 

5 functions, that enhance the frequency of homologous recombination. For example the 
strains may have mutations in both the recB and recC genes or genes that express 
proteins having analogous functions. 

It will be appreciated that Gram-positive organisms having reduced or 
eliminated levels of one or more proteins analogous in function to RecB, RecC, or 

10 RecD may be used with the methods described herein. In some embodiments, Gram- 
positive organisms having reduced or eliminated levels of one or more proteins 
analogous in function to RecB, RecC or RecD may be selected from the group 
consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, 
Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, 

15 Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria 
monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia 
asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus 
xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling 
within the genera of any of the above species. In some embodiments, it may be 

20 desirable to use Escherichia coli having reduced or eliminated levels of one or more 
of RecB, RecC or RecD proteins as a surrogate host in the methods described herein. 

The methods may also be performed in Gram-positive bacterial strains in 
which the activity of a the RecET recombinase system of the Rac prophage, or a 
recombinase system having an analogous function, has been activated. Such strains 

25 are analogous to Escherichia coli strains which carry an sbcA mutation. The RecE 
gene of the Rac prophage encodes ExoVIII a 5'-3' exonuclease, while the RecT gene 
of the Rac prophage encodes a single stranded DNA binding protein which facilitates 
renaturation and D-loop formation. Thus, the gene products of the RecE and RecT 
genes or proteins with analogous functions facilitate homologous recombination. The 

30 RecE and RecT genes are on the same operon and are not normally expressed. 
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Similarly, genes having a function analogous to the RecE and RecT may not be 
expressed However, strains having a mutation analogous to sbcA would activate the 
expression of RecE and RecT or genes having analogous function. In some 
embodiments, the methods may be performed in strains which cany mutations in the 
5 recB and recC genes or genes that express proteins having analogous functions, as 
well as the mutation in the gene analogous in function to sbcA. 

In some embodiments, the methods may be performed in Gram-positive 
bacterial strains in which recombination via the RecF pathway or a pathway having an 
analogous function has been enhanced. One example of such strains are those having 

10 a mutation analogous in function to the Escherichia coli sbcB mutation. 

It will be appreciated that the recE and recT gene products, or proteins with 
analagous functions may be constitutively or conditionally expressed in Gram-positive 
organisms. In some embodiments, these proteins may be conditionally or 
constitutively expressed in Gram-positive organisms such as Bacillus anthracis, 

15 Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium 
teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, 
Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 
tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus 
epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus 

20 mutans or any species falling within the genera of any of the above species. In some 
embodiments, Gram-positive strains may be genetically engineered to express the 
RecE and recT genes of the Rac prophage. For example, plasmids encoding these 
gene products may be introduced into the organism. If desired, the coding sequences 
encoding these gene products may be optimized to reflect the codon preferences of the 

25 organism in which they are to be expressed. In some embodiments, it may be 
desirable to use Escherichia coli constitutively or conditionally expressing the RecE 
and RecT gene products as a surrogate host in the methods described herein. 

In further embodiments, the methods may be conducted in cells which utilize 
the Red system of bacteriophage lambda (k) or analogous systems from other phages 

30 to enhance the frequency of homologous recombination. The Red system contains 
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three genes, y, p and exo whose products are the Gam, Bet and Exo proteins (see Ellis 
et al. PNAS 98:6742-6746, 2001, the disclosure of which is incorporated herein by 
reference in its entirety). The Gam protein inhibits the RecBCD exonuclease V, thus 
permitting Bet and Exo to gain access to the ends of the DNA to be integrated and 

5 facilitating homologous recombination. The Bet protein is a single stranded DNA 
binding protein. The Exo protein is a double-stranded DNA dependent 5 '-3' 
exonuclease. Thus, expression of the X red proteins or proteins having analogous 
functions facilitates homologous recombination. 

It will be appreciated that the X Bet, Gam and Exo proteins, or proteins with 

10 analagous functions may be constitutively or conditionally expressed in Gram-positive 
organisms. In some embodiments, these proteins may be conditionally or 
constitutively expressed in Gram-positive organisms such as Bacillus anthracis, 
Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium 
teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, 

15 Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 
tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus 
epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus 
mutans or any species falling within the genera of any of the above species. For 
example, plasmids encoding these gene products may be introduced into the organism. 

20 If desired, the coding sequences encoding these gene products may be optimized to 
reflect the codon preferences of the organism in which they are to be expressed. In 
some embodiments, it may be desirable to use Escherichia coli constitutively or 
conditionally expressing the X Bet, Gam and Exo gene products as a surrogate host in 
the methods described herein. 

25 In some embodiments, the cells may have an increased frequency of 

homologous recombination as a result of more than one of the aforementioned 
characteristics. In some embodiments, the enhanced frequency of recombination may 
be a conditional characteristic of the cells which depends on the culture conditions in 
which the cells are grown. For example, in some embodiments, expression of the X 

30 Red Gam, Exo, and Bet proteins or recE and recT proteins may be constitutive or 
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regulated Thus, the cells may have an increased frequency of homologous 
recombination as a result of any combination of the aforementioned characteristics. 
For example, in some embodiments, the cell may carry the sbcA and recBC mutations 
or analogous mutations. 

5 In some embodiments, a linear double stranded DNA containing the 

regulatable fusion promoter, operator or other regulatory element to be inserted into 
the chromosome of the prokaryotic organism is introduced into an organism 
constitutively or conditionally expressing the recE and recT or the k Bet, Gam and 
Exo proteins or proteins with analogous functions as described above. In some 

10 embodiments, the double stranded DNA may be introduced into an organism having 
the recBC and sbcA mutations or analogous mutations. 

In other embodiments, a single stranded DNA containing the regulatable 
fusion promoter, operator or other regulatory element to be inserted into the 
chromosome of the prokaryotic organism is introduced into an organism constitutively 

15 or conditionally expressing the k Bet protein or a protein with an analogous function. 
In some embodiments the single stranded DNA is introduced into an organism 
constitutively or conditionally expressing both the k Bet and Gam proteins or proteins 
with analogous functions. In further embodiments, the single stranded DNA is 
introduced into an organism constitutively or conditionally expressing the k Bet, Gam 

20 and Exo proteins or proteins with analogous functions. 

It will be appreciated that any method of enhancing the frequency of 
homologous recombination may be used in conjunction with the present invention and 
that the methods of the present invention are not limited to use in cells having the 
aforementioned characteristics. 

25 The 5' and 3' regions of the linear nucleic acid to be integrated via 

homologous recombination may have any length of homology to the target sequence 
in the chromosome of the organism which permits homologous recombination to 
occur. In some embodiments, the nucleic acid to be integrated via homologous 
recombination is generated by PGR. In such embodiments, it is desirable for the PCR 

30 amplicon to be relatively short Accordingly, in some embodiments, the 5' and 3* 
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regions of the amplicon which have homology to the target sequence in the 
chromosome of the organism, may each be more than 120 nucleotides in length, at 
least 120 nucleotides in length, at least 100 nucleotides in length, at least 90 
nucleotides in length, at least 80 nucleotides in length, at least 70 nucleotides in 
5 length, at least 60 nucleotides in length, at least 50 nucleotides in length, at least 40 
nucleotides in length, at least 30 nucleotides in length or less than 30 nucleotides in 
length. 

As discussed above, the linear double stranded DNA or linear single stranded 
DNA may be introduced into a suitable strain having an enhanced frequency of 

10 recombination. 

In one embodiment of the present invention, genes required for proliferation 
are identified by methods in which the native promoter of a gene is replaced with a 
regulatable fusion promoter using homologous recombination. In such methods, a 
linear promoter replacement construct which comprises a 5' portion homologous to a 

1 5 first portion of the native promoter, a 3' portion homologous to a second portion of the 
native promoter and a regulatable fusion promoter disposed between the 5' portion 
and the 3' portion is introduced into a cell containing the target gene. After 
replacement of the native promoter in the target gene with the regulatable fusion 
promoter via homologous recombination, the extent of proliferation of cells in which 

20 the regulatable fusion promoter has replaced the native promoter under conditions in 
which the regulatable fusion promoter is active at a first level is compared to the 
extent of proliferation of the cells under conditions in which the regulatable fusion 
promoter is active at a second level which is lower than the first level. If the candidate 
gene is required for proliferation, the extent of proliferation of the cells grown under 

25 conditions where the fusion promoter is active at the higher level will be greater than 
the extent of proliferation of ceils grown under conditions where the fusion promoter 
is active at the lower level. 

The regulatable fusion promoter in the promoter replacement construct may be 
any of the regulatable fusion promoters described herein. For example, in some 

30 embodiments, the regulatable fusion promoter may be selected from the group 
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consisting of Xyl-T5, Xyl-T5-DD, Xyl-CP25, Xyl-P32, Xyi-P59, Xyl-P59*, Xyl- 
P1P2, Xyl-PlP2*, Xyl-PL, and Xyl-PLIO. 

In some embodiments, the linear promoter replacement construct may further 
comprise a fusion promoter operably linked to a nucleic acid which encodes an 

5 identifiable marker or a selectable marker. The promoter which is operably linked to a 
nucleic acid encoding an identifiable marker or a selectable marker and the nucleic 
acid which encodes the identifiable marker or selectable marker may be 5* of the 
regulatable fusion promoter. The identifiable marker may be any protein whose 
expression is readily detectable. For example, the identifiable marker may be P- 

10 galactosidase, or any of the markers familiar to those skilled in the art which allow 
identification of cells expressing the marker. Likewise, the selectable marker may be 
any protein which allows selection of cells which express the protein. For example, 
the selectable marker may confer resistance to an antibiotic or permit growth under a 
particular set of culture conditions. A variety of selectable markers are familiar to 

15 those skilled in the art, including genes which confer resistance to antibiotics such as 
chloramphenicol, ampicillin, kanamycin and tetracycline. 

In some embodiments, a transcriptional terminator, such as the rrnB terminator 
may be positioned between the 3' end of the nucleic acid encoding the identifiable 
marker or selectable marker and the 5' end of the regulatable fusion promoter. 

20 The promoter replacement method may be performed in any desired organism. 

For example, in some embodiments, the method may be practiced in Gram-positive 
bacteria. In certain embodiments, the ceil to be used with any of the methods 
described herein is selected from the group consisting of Bacillus anthracis, 
Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium 

25 teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, 
Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 
tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus 
epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus 
mutans or any species falling within the genera of any of the above species. 
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In some embodiments, the promoter replacement method is conducted in cells 
which have an increased frequency of homologous recombination. For example, in 
some embodiments, the method is conducted in cells which cany the sbcA and recBC 
mutations, or in cells having mutations which confer similar phenotypes. In other 

5 embodiments, the cells may express one or more proteins in the A. Red recombination 
system. For example, in some embodiments the cells may constitutively or 
conditionally express the k Gam, Bet, and Exo proteins or other proteins having 
similar functions. In other embodiments, the cells may constitutively or conditionally 
express the X Bet protein or a protein having a similar function. 

10 In some embodiments, a double stranded promoter replacement construct is 

introduced into a cell comprising the recBCD and sbcA mutations, recBC and sbcA 
mutations or mutations with analogous consequences. The double stranded promoter 
replacement construct may or may not comprise a gene encoding a selectable or 
identifiable marker, but in preferred embodiments the double stranded promoter 

15 replacement construct includes a gene enocding a selectable or identifiable marker. In 
other embodiments, a double stranded promoter replacement construct is introduced 
into a cell constitutively or conditionally expressing the recE and red proteins or 
proteins with analogous functions. In still other embodiments, a double stranded 
promoter replacement construct is introduced into a cell constitutively or conditionally 

20 expressing the X Gam, Bet, and Exo proteins or other proteins having similar 
functions. In such embodiments, cells in which the promoter replacement construct 
has integrated are identified by obtaining colonies which express the identifiable 
marker or by growing the cells in appropriate selective media. If desired, homologous 
integration may be confirmed by performing a Southern blot or amplification reaction 

25 using a probe or primers which will yield a known result if homologous recombination 
has occurred. 

In other embodiments, a promoter replacement construct may be introduced 
into the cell as a single stranded DNA. Preferably, the single stranded promoter 
replacement construct does not comprise a gene which encodes an identifiable or 
30 selectable marker, but in some embodiments a gene which encodes an identifiable or 
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selectable marker may be included in the construct. In some embodiments, the single 
stranded nucleic acid may be generated by in vitro synthesis. In other embodiments, a 
circular single stranded DNA, such as Ml 3, comprising the promoter replacement 
construct may be fragmented by sonication and the fragments may be introduced into 

5 the cell. In still other embodiments, a single stranded promoter replacement construct 
is introduced into a cell constitutively or conditionally expressing the X Bet protein, 
the Bet and Gam proteins, the Bet, Gam and Exo proteins or other proteins having 
similar functions. Cells in which integration has occurred at the desired location are 
identified by performing colony hybridization, amplification reactions, or other 

10 methods familiar to those skilled in the art. 

As discussed above, the 5* and 3* portions of the linear promoter replacement 
construct which have homology to the promoter region of the target gene may have 
any length which permits homologous recombination to occur. Accordingly, in some 
embodiments, the 5' and 3' regions of the amplicon which have homology to the 

15 promoter region of the target gene in the chromosome of the organism, may each be 
more than 120 nucleotides in length, at least 120 nucleotides in length, at least 100 
nucleotides in length, at least 90 nucleotides in length, at least 80 nucleotides in 
length, at least 70 nucleotides in length, at least 60 nucleotides in length, at least 50 
nucleotides in length, at least 40 nucleotides in length, at least 30 nucleotides in length 

20 or less than 30 nucleotides in length. 

In some embodiments, the promoter replacement construct is introduced into 
the cell as a linear nucleic acid. In particular, in some embodiments, the promoter 
replacement construct is generated by an conducting amplification reaction, such as a 
PCR reaction, to generate a linear amplicon. In some aspects of this embodiment, the 

25 PCR reaction is performed on a linearized plasmid. In a preferred aspect of this 
embodiment, the amplicon contains about 60 bp of homology to the target gene at 
both the 5' and 3' ends of the construct 

Thus, one embodiment of the invention relates to a method for identifying a . 
gene which is required for proliferation of a cell comprising replacing the native 

30 promoter of a gene with a regulatable fusion promoter and identifying cells in which 
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the extent of proliferation of the cell when the fusion promoter is active at a first level 
is substantially different than the extent of proliferation of the cell when the fusion 
promoter is active at a second level, the first level being greater than the second level. 
In some embodiments, the cell has an enhanced frequency of homologous 

5 recombination. In some embodiments, the fusion promoter is an inducible promoter. 
In other embodiments, the level of transcription from the promoter varies with the 
level of inducer present in the medium in which the cell is grown. In certain 
embodiments, the step of replacing the native promoter comprises introducing a linear 
nucleic acid comprising a 5' portion homologous to a first portion of the native 

10 promoter, a 3' portion homologous to a second portion of the native promoter and the 
regulatable promoter disposed between the 5* portion and the 3' portion into a cell 
such that homologous recombination occurs between the 5* portion and the first 
portion of the native promoter and between the 3' portion and the second portion of 
the native promoter. The nucleic acid may, in some embodiments, further comprise a 

1 5 gene encoding a selectable marker. The regulatable fusion promoter may be any of the 
fusion promoters described herein. In some embodiments, the 5' portion comprises at 
least 60 nucleotides homologous to the first portion of the native promoter and the 3' 
portion comprises at least 60 nucleotides homologous to the second portion of the 
native promoter. Certain embodiments include those in which the nucleic acid is 

20 linear. In some embodiments, the step of identifying cells comprises identifying cells 
which proliferate to a greater extent in the presence of inducer than in the absence of 
inducer. In other embodiments, the Shine Delgarno sequence of said native promoter 
is not replaced. 

FIG. 3A illustrates one embodiment of the present promoter replacement 
25 method in which a linear double stranded promoter replacement construct comprising 
a nucleic acid encoding a selectable marker, a transcriptional terminator, and a 
regulatable fusion promoter is inserted into the chromosome of a cell expressing the 
RecE and RecT gene products of the Rac prophage or gene products having analogous 
functions. FIG. 3B illustrates one embodiment of the present promoter replacement 
30 method in which a linear single stranded promoter replacement construct comprising a 
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5' portion homologous to a target sequence, a 3' portion homologous to a target 
sequence and a regulatable fusion promoter is inserted into the chromosome of a cell 
expressing the X Beta and Gam proteins. 

As discussed below, the promoter replacement method may also be used to 
5 obtain cells in which the level or activity of a gene product required for proliferation is 
reduced. Such cells may be used in cell-based assays to identify compounds which 
inhibit proliferation. 

In another embodiment of the present invention, genes required for 
proliferation are identified by methods in which a an operator is integrated into the 
10 promoter of a target gene or into the target gene itself via homologous recombination. 
The operator may be any nucleic acid which allows the activity or level of the gene 
product encoded by the target gene to be modulated. For example, in some 
embodiments, the operator may be recognized by a repressor. In such methods, a 
linear operator insertion construct which comprises a 5* portion homologous to a first 
15 portion of the target promoter or gene, a 3' portion homologous to a second portion of 
the target promoter or gene and an operator disposed between the 5* portion and the 3' 
portion is introduced into a cell containing the target gene. After integration of the 
operator, expression of the target gene is under the control of the operator. For 
example, in some embodiments, the operator may be recognized by a transcriptional 
20 repressor. Numerous operators recognized by transcriptional repressors, such as the 
operators described herein, including the lac operator, the tet operator, the xyl 
operator, Xcl operator, the mal operator, or the trp operator, may be used. 

After insertion of the operator in the target promoter or gene via homologous 
recombination, the extent of proliferation of cells in which the operator has been 
25 inserted under conditions in which the promoter is active at a first level is compared to 
the extent of proliferation of the cells under conditions in which the promoter is active 
at a second level which is lower than the first level. If the candidate gene is required 
for proliferation, the extent of proliferation of the cells grown under conditions where 
the promoter is active at the higher level will be greater than the extent of proliferation 
30 of cells grown under conditions where the promoter is active at the lower level. 
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The operator may be integrated at any location in which the activity or level of 
the gene product encoded by the target gene is modulated by the operator. For 
example, the operator may be integrated at any location in which expression of the 
target gene will be regulated by the effector which acts on the operator. For example, 

5 if the operator is integrated into a bacterial cell, the operator may be placed upstream 
of the -35 box, between the -35 box and -10 box, or downstream of the -10 box 
(e.g.overlapping the transcription start site). In some embodiments, the operator is 
integrated so as to maintain the appropriate relationships on the DNA helix between 
the transcription factors which direct expression from the promoter which is placed 

1 0 under the control of the operator. 

As discussed above, preferably the cells into which the operator is to be 
integrated have an enhanced frequency of homologous recombination. In some 
embodiments, the operator may be introduced into the cell as linear double stranded 
DNA. Such DNA may be generated by an amplification reaction, such as a PCR 

15 reaction. In some aspects of this embodiment, the PCR reaction is performed on a 
linearized plasmid. In such embodiments, the cells may constitutively or conditionally 
express the X Red recombinase system, comprising the Gam, Exo, and Bet proteins or 
an analogous system from another phage. Alternatively, the cells may comprise the 
recBCD and sbcA mutations, recBC and sbcA mutations or analogous mutations. In 

20 some embodiments, the cells constitutively or conditionally express the recE and recT 
proteins or proteins with analogous functions. In some embodiments, the cells 
comprise the recBCD and sbcA mutations, recBC and sbcA mutations, or analogous 
mutations, and constitutively or conditionally express the recE and recT proteins, or 
analogous proteins. 

25 In some embodiments, the operator may be introduced into the cells as linear 

single stranded DNA. Such DNA may be synthesized in vitro or generated by 
sonication of a circular single stranded nucleic acid such as an Ml 3 construct 
comprising the operator insertion construct In some embodiments, the cells may 
constitutively or conditionally express the Bet protein of the k Red recombinase 

30 system or an analogous protein from another phage. Li still further embodiments, the 
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expression of the proteins in the X Red recombinase system, the Bet protein from the X 
Red recombinase system, or analogous proteins from other phage may be conditional. 

In some embodiments, the cells may be any of the cells described above which 
have an increased frequency of homologous recombination or any other cells having 
5 an increased frequency of homologous recombination which are familiar to those of 
skill in the art. 

In some embodiments, the 5* and 3' portions of the linear operator insertion 
construct which have homology to the promoter region of the target gene or the target 
gene may have any length which permits homologous recombination to occur. 

10 Accordingly, in some embodiments, the 5' and 3' regions of the construct which have 
homology to the promoter region of the target gene or the target gene, may each be 
more than 120 nucleotides in length, at least 120 nucleotides in length, at least 100 
nucleotides in length, at least 90 nucleotides in length, at least 80 nucleotides in 
length, at least 70 nucleotides in length, at least 60 nucleotides in length, at least 50 

15 nucleotides in length, at least 40 nucleotides in length, at least 30 nucleotides in length 
or less than 30 nucleotides in length. For example, in some embodiments, the 5' and 
3' regions of the construct which have homology to the target promoter or gene may 
be about 30 nucleotides in length. 

Alternatively, in some embodiments, the operator insertion construct may be a 

20 single stranded nucleic acid, such as a synthetic oligonucleotide. Preferably, the single 
stranded nucleic acid is introduced into cells constitutively or conditionally expressing 
a protein which stabilizes single stranded DNA, such as the X Bet protein or an 
analogous protein, as described in Ellis et aL, PNAS 98: 6742-6746, 2001, the 
disclosure of which is incorporated herein by reference in its entirety. After 

25 introduction of the operator insertion construct into a suitable cell, cells in which the 
operator has integrated by homologous recombination are identified. The cells may be 
identified by performing an amplification reaction or Southern blot. Alternatively, to 
identify cells in which expression of a gene required for proliferation is regulated by 
the inserted operator, cells which grow well under conditions in which the promoter 

30 which is regulated by the operator is active at a higher level but not under conditions 
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in which the promoter is active at a lower level are identified. For example, cells 
which grow on media containing an inducer which increases the level of transcription 
from the promoter but not on media in which transcription is repressed may be 
identified. In some embodiments, the operator may be the lac operator and cells 

5 which grow on media containing IPTG but not on media lacking IPTG may be 
identified. In some embodiments, the operator may be the tet operator and cells which 
grow on media containing tetracycline but not on media lacking tetracycline may be 
identified. In some embodiments, the operator may be the xyl operator and cells 
which grow on media containing xylose but not on media lacking xylose may be 

10 identified. In some embodiments, the operator may be the mal operator and cells 
which grow on media containing maltose but not on media lacking maltose may be 
identified. In some embodiments, the operator may be the trp operator and cells 
which grow on media lacking tryptophan but not on media containing tryptophan may 
be identified. In some embodiments, the operator may be the Acl operator and cells 

15 which grow at the premissive temperature but not at the restrictive temperature may be 
identified. 

The operator insertion method provides a fast and inexpensive way to engineer 
directed insertions into the chromosome. In some embodiments, the operator insertion 
construct may lack a nucleic acid encoding an identifiable or selectable marker. The 

20 method allows insertions of only the desired nucleic acid sequence without the 
necessity of introducing additional nucleic acid sequences such as sequences encoding 
identifiable or selectable markers. In addition, in embodiments where the native 
promoter is placed under the control of an operator, the expression level under 
conditions in which the promoter is not repressed or activated will be the 

25 physiological level. 

The operator insertion method may be performed in any desired prokaryotic 
organism. For example, in some embodiments, the method may be practiced in Gram- 
positive bacteria. In certain embodiments, the cell to be used with any of the methods 
described herein is selected from the group consisting of Bacillus anthracis, 

30 Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium 
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teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faeciiim, 
Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 
tuberculosis, Nocardia asteroides, Staphylococcus aureus. Staphylococcus 
epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus 

5 mutans or any species falling within the genera of any of the above species. 

In some embodiments, the operator insertion method is conducted in cells 
which have an increased frequency of homologous recombination. For example, in 
some embodiments, the method is conducted in cells which carry the sbcA and recBC 
mutations, or in cells having mutations which confer similar phenotypes. In other 

10 embodiments, the cells may constitutively or conditionally express one or more 
proteins in the k Red recombination system. For example, in some embodiments the 
cells may constitutively or conditionally express the k Gam, Bet, and Exo proteins or 
other proteins having similar functions. In other embodiments, the cells may 
constitutively or conditionally express the k Bet protein or a protein having a similar 

15 function. 

In some embodiments, the promoter region for a target gene/operon is 
identified either experimentally or bioinformatically. An oligonucleotide is designed 
which matches at least 90, at least 70, at least 60, or at least 50 residues of the 
promoter region. An operator is engineered to be present in the middle of the 

20 oligonucleotide sequence. Thus, the operator will have at least 45, at least 35, at least 
30, or at least 25 nucleotides of homology to the target region on each side. Examples 
of operators include the lac, tet y xyl, mal, trp,ox Xcl operators, though any other 
operator known in the art can function. The oligonucleotide can be designed such that 
the operator is inserted: i) upstream of the -35 box, ii) in between the -35 and the -10 

25 box, or iii) downstream of the -10 box. 

In some embodiments, the oligonucleotides are introduced into cells that 
conditionally or constitutively express the k Red (or homologous phage) 
recombination system, which consists of the k Gam, Exo, and Bet proteins (Ellis et 
al., PNAS 98:6742-6746 (2001)). This recombination system efficiently recombines 

30 oligonucleotide DNA into the bacterial chromosome. Specifically, in some 
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embodiments, the cells constitutively or conditionally express the X Bet protein, which 
is a single strand binding protein that promotes annealing of complementary single 
strand and can mediate strand exchange, or another protein with an analogous 
function. Transformants are then plated on non-selective plates. To ascertain if a 

5 transforming is a recombinant, it is replica plated on plates that contain and lack the 
appropriate inducer. The plates are screened for an expected phenotype. For example, 
if the xyl operator is inserted downstream of the -1 0 box of an essential gene, then one 
would predict that recombinants would only be able to grow in the presence of inducer 
(xylose). For genes of known or expected phenotypes, e.g., those resulting in growth, 

10 transfoimants could be rapidly screened by plating on inducing (de-repressing) plates 
and then replica plating transformants on plates containing or lacking inducer. Those 
transformants that do not grow in the absence of inducer would contain the desired 
operator insertion. 

Recombinants could also be verified by Southern or PCR analysis. This 

15 method is amenable to high throughput and thus many targeted recombinants could be 
constructed in a short time. 

The operator insertion method also has the advantage of being portable to 
other bacteria by conditionally inducing the expression of the X Bet protein. 
Additionally, this method could be employed to test if a gene is essential by screening 

20 for conditional growth, as discussed herein. 

In some embodiments, the operator insertion method is conducted in cells 
which have an increased frequency of homologous recombination. For example, in 
some embodiments, the method is conducted in cells which carry the sbcA and recBC 
mutations, or in cells having mutations which confer similar phenotypes. In other 

25 embodiments, the cells may constitutively or conditionally express one or more 
proteins in the X Red recombination system. For example, in some embodiments the 
cells may constitutively or conditionally express the X Gam, Bet, and Exo proteins or 
other proteins having similar functions. In other embodiments, the cells may 
constitutively or conditionally express die X Bet protein or a protein having a similar 

30 function. 
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In some embodiments, a double stranded operator insertion construct is 
introduced into a cell comprising the recBCD and sbcA mutations, recBC and sbcA 
mutations or mutations with analogous consequences. The double stranded operator 
insertion construct may or may not comprise a gene encoding a selectable or 

5 identifiable marker, but in preferred embodiments the double stranded operator 
insertion construct includes a gene enocding a selectable or identifiable marker. In 
other embodiments, a double stranded operator insertion construct is introduced into a 
cell constitutively or conditionally expressing the recE and recT proteins or proteins 
with analogous functions. In still other embodiments, a double stranded operator 

10 insertion construct is introduced into a cell constitutively or conditionally expressing 
the X Gam, Bet, and Exo proteins or other proteins having similar functions. In such 
embodiments, cells in which the operator insertion construct has integrated are 
identified by obtaining colonies which express the identifiable marker or by growing 
the cells in appropriate selective media. If desired, homologous integration may be 

15 confirmed by performing a Southern blot or amplification reaction using a probe or 
primers which will yield a known result if homologous recombination has occurred. 

In other embodiments, a single stranded operator insertion construct is 
introduced into the cell. Preferably, the single stranded operator insertion construct 
does not comprise a gene which encodes an identifiable or selectable marker, but in 

20 some embodiments a gene which encodes an identifiable or selectable marker may be 
included in the construct. In some embodiments, the single stranded nucleic acid may 
be generated by in vitro synthesis. In other embodiments, a circular single stranded 
DNA, such as Ml 3, comprising the operator insertion construct may be fragmented by 
sonication and the fragments may be introduced into the cell. In still other 

25 embodiments, a single stranded operator insertion construct is introduced into a cell 
constitutively or conditionally expressing the X Bet protein, the Bet and Gam proteins, 
the Bet, Gam and Exo proteins or other proteins having similar functions^ Cells in 
which integration has occurred at the desired location are identified by performing 
colony hybridization, amplification reactions, or other methods familiar to those 

30 skilled in the art. 
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FIG. 4A illustrates one embodiment of the operator insertion method in which 
a linear double stranded regulatory element insertion construct comprising a nucleic 
acid encoding a selectable marker, a transcriptional terminator, and an operator is 
inserted into the chromosome of cells expressing the RecE and RecT gene products of 
5 the Rac prophage or gene products having analogous functions. FIG. 4B illustrates 
one embodiment of the operator insertion method in which a linear single stranded 
regulatory element insertion construct comprising a 5* portion homologous to a target 
sequence, a 3' portion homologous to a target sequence and an operator is inserted 
into the chromosome of a cell expressing the X Beta and Gam proteins. 

10 As discussed below, the operator insertion method may also be used to obtain 

cells in which the level or activity of a gene product required for proliferation is 
reduced. Such cells may be used in cell-based assays to identify compounds which 
inhibit proliferation. 

In still another embodiment, a vector containing an outward-directed inducible 

15 promoter is randomly integrated into the genome of bacteria and clones containing 
these promoters that exhibit a reduction in viability and/or cell growth are identified. 
This technique is referred to as Random Insertional Cis-Antisense (random promoter 
insertion). This strategy involves the use of constructs that contain an inducible 
promoter (e.g., one of the promoters described herein) and a selectable marker to 

20 randomly integrate into the genome of a bacteria. When such constructs are integrated 
into or near an essential gene, induction of transcription from the inducible promoter 
will frequently reduce or eliminate normal expression of the essential gene. This 
event is particularly likely if the vector integrates such that transcription from the 
inducible promoter is in an antisense orientation relative to normal transcription of the 

25 endogenous gene. The integration in such cases is likely to be 3' to the gene, within 
the 3' untranslated region of the gene, within the coding region of the gene, or in the 
case of eukaryotes, within an intron of the gene. Induction of transcription in such 
integrants will dramatically interfere with normal sense transcription of the gene. 

Since the expression of essential genes is required for proliferation of the 

30 organism, when such integration events occur within or near an essential gene, 
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induction of transcription will be lethal or will severely impair proliferation of the 
organism. Such conditional lethal or conditional growth clones can be readily isolated 
and the genes into which the vector has integrated can be determined by the cloning 
. and sequencing techniques described herein. In this manner, the random promoter 

5 insertion method can be used to discover essential bacterial genes. 

Dominant negative mutants are another class of conditional lethal mutants that 
can be identified by using the random promoter insertion technique. These mutants 
can be generated when the vector integrates such that transcription from the inducible 
promoter is in the sense orientation relative to normal transcription of the endogenous 

10 gene. In this case the integration site can be 5' to the gene, within the 5' untranslated 
region of the gene, within the coding region of the gene, or in the case or eukaryotes, 
within an intron of the gene. Because the dominant negative mutants will involve 
certain classes of essential genes, the dominant negative mutants are useful for 
identifying a subset of essential genes. Such conditional dominant negative mutants 

15 can also be employed in a cell-based assay. 

By one approach, a vector having the outward-directed inducible promoter is 
linearized by cutting with a restriction endonuclease and the linearized vector is 
transformed into bacteria. Sites of integration will be random but in many cases, the 
inducible promoter is oriented such that it produces an antisense RNA. Transformants 

20 are robotically picked, grown, and gridded onto inducing and non-inducing media 
(e.g., media having and lacking xylose, media having and lacking tetracycline, and 
media having and lacking maltose). The gridded plates are then screened for 
transformants that exhibit a growth defect on inducing media relative to the phenotype 
on non-inducing media. Transformants that have a reduced viability and/or cell 

25 growth in the presence of the inducer harbor a promoter that has integrated proximal 
to an essential gene. Once identified, the clone is sequenced to determine the identity 
of the essential gene. The genes that are affected by the random promoter insertion 
either directly or indirectly (e.g., by interacting with the RNA or protein that 
corresponds to the gene) causing a reduction in viability and/or cell growth are used as 

30 drug discovery targets for high throughput drug screening. 
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In some embodiments of the present invention, complementary to genes required for 
the proliferation of a host organism from which the antisense molecules were 
originally obtained may be used to identify homologous antisense nucleic acids and 
the coding nucleic acids complementary thereto from cells or microorganisms other 
5 than the original host organism, to inhibit the proliferation of cells or microorganisms 
other than the original host organism by inhibiting the activity or reducing the amount 
of the identified homologous coding nucleic acid or homologous polypeptide in the 
cell or microorganism other than the original host organism, or to identify compounds 
which inhibit the growth of cells or microorganisms other than the original host 

10 organism as described below. For example, antisense nucleic acids complementary to 
proliferation-required genes from original host organism may be used to identify 
compounds which inhibit the growth of Gram-positive organisms such as Bacillus 
anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, 
Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus 

15 faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, 
Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, 
Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, 
Streptococcus mutans or any species falling within the genera of any of the above 
species. 

20 In one embodiment of the present invention, antisense nucleic acids complementary to 
the sequences identified as required for proliferation or portions thereof are operably 
linked to a regulatable fusion promoter, such as one of the fusion promoters described 
herein, contained in a vectors capable of replicating within a species other than the 
species from which the proliferation-required sequences were obtained. Expression 

25 vectors which contain a fusion promoter and one or more origins of replication that 
are functional in one or more organisms, such as the expression vectors described 
herein, may be used. For example, the vector may be capable of replicating and/or 
producing transcripts in one or more of the following Gram-positive organisms 
selected from the group consisting of Bacillus anthracis, Clostridium botulinum, 

30 Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium 
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diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria 
monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia 
asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus 
xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling 
5 within the genera of any of the above species. 

Following the subcloning of the antisense nucleic acids, which are 
complementary to proliferation-required sequences or portions thereof from the host 
organism from which the stabilized antisense nucleic acids were originally obtained, 
into an expression vector under control of a regulatable fusion promoter capable of 

10 replicating and producing stabilized transcripts in a second cell or microorganism of 
interest (i.e. a cell or microorganism other than the one from which the identified 
nucleic acids were obtained), the antisense nucleic acids are conditionally transcribed 
to test for bacterial growth inhibition. The coding sequences complementary to the 
antisense nucleic acids from the original host that, when transcribed, inhibit growth of 

15 the second cell or microorganism are compared to the known genomic sequence of the 
second cell or microorganism to identify the homologous gene from the second 
organism. If the homologous sequence from the second cell or microorganism is not 
known, it may be identified and isolated by hybridization to the proliferation-required 
sequence of interest from the original host organism or by amplification using PCR 

20 primers based on the proliferation-required nucleotide sequence of interest. In this way, 
nucleotide sequences which may be required for the proliferation of the second cell or 
microorganism may be identified. For example, the second microorganism may be 
selected from the group consisting of Bacillus anthracis, Clostridium botulinum, 
Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium 

25 diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria 
monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia 
asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus 
xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling 
within the genera of any of the above species. 
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The homologous nucleic acid sequences from the second cell or 
microorganism which are identified as described above may then be operably linked to 
a promoter, such as one of the regulatable fusion promoter described herein, in an 
antisense orientation and introduced into the second cell or microorganism. The 
5 techniques described herein for identifying genes required for proliferation may thus be 
employed to determine whether the identified nucleotide sequences from a second cell 
or microorganism inhibit the proliferation of the second cell or microorganism. For 
example, the second microorganism may be selected from the group consisting of 
Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium 

10 perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, 
Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium 
leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, 
Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, 
Streptococcus mutans or any species falling within the genera of any of the above 

15 species. If the stabilized antisense nucleic acid inhibits the proliferation of the second 
cell or microorganism, a cell-based assay, such as the one described herein, can be used 
to test and identify candidate antibiotic compounds. 

In another embodiment of the present invention, screening of candidate 
antibiotic compounds can be performed directly by using the antisense molecule isolated 

20 from the original host organism. In this embodiment, an antisense nucleic acid 
comprising a nucleic acid complementary to the proliferation-required sequences from 
the original host organism or a portion thereof is transcribed in the second host from a 
regulatable fusion promoter such as one of the fusion promoters described herein. If the 
antisense molecule is transcribed so as to sufficiently alter the level or activity of a 

25 nucleic acid required for proliferation of the second host, the second host may be used 
directly in a cell-based assay, such as those described herein, to identify candidate 
antibiotic compounds. 

The techniques for identifying drug discovery targets described above can be 
used to generate bacterial strains for high throughput drug screening methodologies. 

30 Because the promoters described herein can be tightly and/or finely regulated over a 
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wide range of maximal and basal transcription levels, one can selectively synthesize a 
precise amount of transcript or peptide sufficient to sensitize the cells without 
achieving complete lysis of cell population. For example, bacterial strains comprising 
the fusion promoters described herein are created using the EGI, promoter 
5 replacement, operator insertion, and random promoter insertion techniques. By 
carefully adjusting the amount of inducer in the growth media, an optimal balance of 
slowed growth and sensitivity to compounds can be achieved. Libraries of test 
compounds are then screened against the sensitized cells, the effect on cell growth is 
monitored, and compounds that further reduce viability and/or cell growth are 

10 identified. Through subsequent rounds of drug design, medicinal chemistry, and 
analysis, optimal antibiotics are developed using this inventive method. 

Other embodiments concern an in vivo system for determining whether a gene 
is required for infection of a host by a bacterium. Such a system can also be used for 
validating the drug discovery targets and lead compounds identified by the approaches 

15 described above. Accordingly, bacterial strains (e.g., including bacterial strains such 
as Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium 
perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, 
Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium 
leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, 

20 Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, 
Streptococcus mutans or any species falling within the genera of any of the above 
species) having essential genes under the control of the regulatable promoters 
described herein are created using the EGI technology, promoter replacement 
technology, operator insertion technology or the random promoter insertion 

25 technology and these bacteria are used to inoculate a suitable host animal (e.g., 
neutropenic mice). Two hours after inoculation, an experimental group of animals are 
either provided a solution containing inducer (if the EGI technology or random 
promoter insertion technology was used to create the compromised cells) or a solution 
containing a minimal amount of inducer (if the promoter replacement technology or 

30 operator insertion technology was used to create the compromised cells). 
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Accordingly, the bacteria are induced to generate the transcript or peptide that 
interacts with the essential gene or gene product or transcription of an endogenous 
essential gene is reduced. Subsequently, the amount of viable bacteria remaining in 
the animal or tissue sample from the animal is determined. By using this approach, 
5 one can rapidly evaluate bacterial strains carrying essential genes that can be 
selectively downregulated in vivo. Additionally, one can use these animal models to 
screen lead compounds in vivo so as to identify new antibiotics. 



10 EXAMPLE 1 

Construction of Fusion Promoters Having a Single Operator 
This example describes the construction of the fusion promters that can be 
used to drive transcription of an RNA and/or expression of a peptide or protein in 
bacteria in a tightly regulated fashion. Many of the promoters described herein can 

15 function in multiple bacterial species including Gram-positive organisms such as 
Staphylococcus and Enterococcus species. 

For construction of the Xyl-T5 fusion promoter, which is functional, for 
example, in Staphylcoccus aureus, a S. aureuslE. coli shuttle vector, referred to as 
pRNLX2, was initially constructed. First, the multiple cloning site was removed from 

20 the 5. aureus vector pRN5548, obtained from Richard Novick at the Skirball Institute 
at New York University, by digesting the vector with Hindlll The remaining vector 
sequence was self-ligated, which resulted in the formation of a new vector referred to 
as pRNHm. Next, the pRNHm vector was digested with Sail and the DNA was 
treated with Calf intestine alkaline phosphatase (CIP). Subsequently, the Sail 

25 linearized and CIP-treated pRNHffl vector was ligated to a 2.6 kb XhoVSall fragment 
of pLexSBA, which contains the plasmid origin and ampicillin resistance marker, 
piederich, L.e/ al, BioTechniques 16: 916-923 (1994), the disclosure of which is 
incorporated herein by reference in its entirety). This vector, referred to as pRNLX2, 
contains both a S. aureus and an E. coli origin of replication. Thus, the vector can 

30 replicate in both Gram-negative and Gram-positive bacteria. 
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A xyl expression system was then constructed using the pRNLX2 shuttle 
vector. First, the xyl repressor gene and promoter were obtained from the plasmid 
pWH942 using PCR with the flanking primers 5 TGGCATTCTACGACTATAAC 3' 
(SEQ ID No.: 3) and 5 ' GCG AATTCT AC ATTTT AGTTGGTT AATTT 3' (SEQ ID 
5 No.: 4). (Schnappinger, D. et al. 9 FEMS Microbiology Letters 129: 121-128 (1995), 
the disclosure of which is incorporated herein by reference in its entirety). These 
primers contain EcoRJ sites near their 5' ends and result in a PCR amplicon with 
trimmable EcoRI sites. The amplicons were digested with EcoRJ and, subsequently, 
ligated into the pRN5548 vector, which had been linearized by digestion with EcoRI 
10 and Swal. The resulting plasmid, called pXylRN, was used as a source for the xyl 
fragment for the final construct. The xyl fragment was excised from pXylRN with 
EcoRI and PshAI and was ligated to pRNLX2 that had been linearized with EcoRI and 
PvuII. The structure of the resultant plasmid was confirmed by sequencing the insert 
from both sides. 

15 In the final step, the xyl promoter from pKLO20 was removed and replaced by 

the Xyl-T5 promoter. The xyl promoter and operator of pKLO20 is flanked by an 
RsrII site at its 5 T end and the EcoRI sito of the multiple cloning site at its 3* end. The 
promoter and operator were removed by digesting the plasmid with RsrII and EcoRI* 
The larger vector fragment of this digestion was purified from the smaller promoter 

20 fragment by preparative gel electrophoresis and a pair, of oligonucleotides were 
designed having the following sequences: 

5 ' GACCGTC AT AAAAAATTT ATTTGCTTTC^ 

G ATTCAAGTTAGTTTGTTT ATTAAATTAACCAACTAAAATGT AG3 * (SEQ ID 
25 No.: 5) 

5 77UTACATTTTAGTTGGTTAATTTAATAAACAAACTAACT 
TATACAGAAAAATTTTCCTGAAAGCAAATAAATTTT^ (SEQ ID 

No.: 6). 



30 



WO 02/051982 



PCTAJS01/50250 



These oligonucleotides were annealed and, thereby, formed a double-stranded 
fragment This fragment included: the Rsrll sticky end (underlined), an AT rich 
region, the T5 promoter, the xylO, and the EcoRI sticky end (italicized). This 
fragment was then combined with the purified RsrII-EcoRI vector fragment of 
5 pKLO20 and ligated to form the pXyl-T5 vector. 

To facilitate the construction of fusion promoters that function inEnterococcus 
faecalis, a reporter plasmid having a swapable promoter module located between a xyl 
repressor gene (xylR) and xyl operator (xylO) was designed. The Lactococcus lactis 
promoter probe vector pAK80 (described in Israelsen, et al. Appl Environ, Microbiol 

10 61, 2540-47 (1995), the disclosure of which is incorporated by reference in its 
entirety), which contains replicons functional in both Escherichia coli (pl5A origin) 
and Enter ococcus faecalis (PCT1138 replicon) as well as a selectable marker for 
erythromycin resistance (erm), was digested at the unique Hindm and BamHI 
polylinker sites just upstream from the promoterless Leuconostoc mesenteroides P- 

15 galactosidase reporter genes {lacL-lacM). After digestion, the 11 kb vetor fragment 
was purified from the 26 bp polylinker by preparative gel electrophoresis then ligated 
to a 1535 bp gel purified Hindm/BamHI fragment from pEP25 (SEQ ID NO: 7) which 
contains the CP25 promoter from Lactococcus lactis (SEQ ID NO.: 38) flanked 
upstream by the xylose repressor gene (xylR) and downstream by the xylose operator 

20 sequence (xylO). A portion of the resulting ligation mixture was transformed into 
competent Escherichia coli XL-1 Blue cells (Stratagene, La Jolla, CA) and 20 jil, 75 
\il or 100 |il of the transformation mixture was plated on BYGT medium containing 
20 ng/ml erythromycin. Isolated erythromycin-resistant transformants were picked 
and streaked to obtain single colony isolates. Plasmid DNA was then purified from 

25 representative single colony transformants. The presence of the chimeric plasmid 
pEPEFl (SEQ ID NO.: 8) was confirmed by digesting purified plasmid preparations 
with HindUI/BamHI and detecting an excised 1535 bp fragment, which corresponds to 
the fragment containing the Xyl-CP25 fusion promoter, by gel electrophoresis. 

FIG. 2 shows a region of pEPEFl that contains the Xyl-CP25 fusion promoter. 

30 Immediately adjacent to the 5' end of the CP25 promoter and just downstream of xylR 
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lies a unique Rsrll recognition site (element (a) in FIG. 2). A unique recognition site 
for Xhol (element (b) in FIG. 2) lies immediately adjacent to the 3* end of the CP25 
promoter just upstream from xylO. The placement of these two unique restriction sites 
allow the CP25 promoter to be removed from pEPEFl and replaced with promoter 
5 sequences from Lactococcus or other microbial species. For example, the CP25 
promoter can be replaced with promoters having SEQ ID NOs.: 36 - 45. 

In one example, a pEPEFl derivative was created by replacing the CP25 
promoter with the P59 promoter from Lactococcus lactis. Two complementary DNA 
oligonucleotides corresponding to the sequence of the P59 promoter were synthesized 
10 (SEQ ID NOs.: 9 and 10). Each sequence included a 5'-overhang end complementary 
to either an RsrH or Xhol overhang. 

S'GACCGACATTAAATTCTTGACACKjGAGAGATAGGTTTGATAGAATATAA 
TAGTTGTC3' (SEQ ID NO.: 9) 

15 

5 ' GAC AACTATTATATTCTATC AAACCT ATCTCTCCCTGTC AAG AATT 
TAATGTCG3' (SEQ ID NO.: 10) 

The underlined nucleotides comprise the 5'-overhang of the Rsrll recognition 
20 site whereas the italicized nucleotides correspond to the 5*-overhang of the Xhol 
recognition site. The double stranded P59 RsrII/XhoI fragment was constructed by 
combining an equimolar amount of SEQ ID NOs.: 9 and 10, heating the mixture to 
90°C to denature any secondary structure, and allowing the complementary sequences 
to anneal by slowly cooling the solution to room temperature. The double stranded 
25 P59 promoter sequence was then ligated into gel-purified, RsrD/XhoI-digested 
pEPEFl reporter vector thereby creating the Xyl-P59 fusion promoter. This reporter 
vector was termed pEPEFl-P59. 

It will be appreciated that any promoter sequence, including the promoters of 
SEQ ID NOs.: 36 - 45, can be synthesized to include an Rsrll overhang at one end and 
30 a Xhol overhang at the other. Promoter sequences can be wildtype promoters known 
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from Lactococcus lactis and other Gram-positive species or promoters that have been 
modified to alter their transcription efficiency in gram-positive organisms. Example 7 
provides a description of promoters that have been modified to alter their transcription 
efficiency in Staphylococcus aureus or Enterococcus faecalis. 
5 The next example describes the construction of fusion promoters that have 

multiple operators. 

EXAMPLE 2 

Construction of Fusion Promoters Having a Plurality of Operators 

10 This example describes the construction of fusion promoters comprising a 

plurality of operators. Fusion promoters such as Xyl/Lac-T5 and Xyl/Lac-P59 can be 
constructed by using a cloning strategy that is similar to the promoter swaping 
procedure described in Example 1. It will be appreciated that this procedure may be 
used to generate fusion promoters having two or more operators selected from the 

15 group consisting of xylO, tet0 9 trpO, malO, XclO and lacO. 

The lac operon is contained in many vectors including pGEM-HZf(-), which 
is commercially available from Promega Corp. The sequence of the pGEM-HZf(-) 
vector can be found in the NCBI database, Accession number X65314, the disclosure 
of which is incorporated herein by reference in its entirety. The lac operator 

20 comprises the 17bp sequence: (TTGTTATCCGCTCACAA) (SEQ ID No. 11), which 
can be inserted in place of the 17 bp intervening sequence between the -35 Box and - 
10 Box of a variety of promoters, including the promoters of SEQ ID NOs.: 26 - 35. 

In one example, a Xyl/Lac-T5 promoter can be made by replacing the 17 bp 
sequence, which comprises element (3) in FIG. 1, with the 17 bp lac operator 

25 sequence. To make the Xyl/Lac-T5 fusion promoter, two oligonucleotides are 
synthesized and annealed to create a double stranded fragment. The two 
oligonucleotides are provided below: 

5 'GACCGTCATAAAAAATITATTTGCTTTGTGAGCGGATAAC AATTATA 
30 ATAG ATTC AAGTTAGTTTGTTTATTAAATTAACCAACTAAAATGTAG3 ' 
(SEQ ID No.: 12) 
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5 J AA 77CTACATTTT AGTTGGTT AA^ 

ATTATAATTGTTATCCGCTCAC AAAGC AAATAAATTTTTTATGACG3 ' 
(SEQ ID No.: 13). 

5 

The resulting double stranded fragment includes: the RsrII sticky end 
(underlined portion of SEQ ID NO.: 12), an AT rich region found to increase the level 
of transcription from the T5 promoter in Gram-positive bacteria, the T5 promoter, the 
lac operator (lacO), the xylose operator (xy/O), and the EcoRJ sticky end (italicized 
10 portion of SEQ ID NO.: 13). This double standed fragment can be combined with the 
purified Rsrll/EcoRI vector fragment of pKLO20 and ligated to form the vector 
pXyl/LacT5. 

In a similar example, a Xyl/Lac-P59 promoter can be constructed by replacing 
the 17 bp sequence between the -35 Box and -10 Box regions (nucleotides 18-34) of 
15 SEQ ID NO.: 40) of P59 with the 17 bp lac operator sequence (SEQ ID NO.: 1 1). To 
make the Xyl/Lac-P59 fusion promoter, two oligonucleotides are synthesized and 
annealed to create a double stranded fragment. The two oligonucleotides are provided 
below: 

20 S'G^CGACATTAAATTCTTGACATTGTGAGCGGATAACAATAGAATATAA 
TAGTTGTC3' (SEQ ID NO.: 14) 

5TCG^GACAACTATTATATTCTATTGTTATCCGCTCACAATGTCAAGAATT 
TAATGTCG-3' (SEQ ID NO.: 15). 

25 

The underlined nucleotides comprise the S'-overhang of the RsrII recognition 
site whereas the italicized nucleotides correspond to the 5'-overhang of the Xhol 
recognition site. The double stranded P59/Lac fusion promoter formed by the 
annealing of these two oligonucleotides is then ligated with purified, Rsrll/Xhol- 
30 digested pEPEFl, which contains the xylO operator, to form the vector pEPEFl- 
Lac/P59. 
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It will be appreciated that there are many equivalent techniques that can be 
used to construct fusion promoters having a plurality of operators. It will also be 
appreciated that operators other than lacO and promoters other than those described 
herein may be used in multi-operator constructs. For example, the fusion promoters 
5 may comprise a plurality of operators selected from the group consisting of xylO y tetO, 
trpO, malO y Ac 10 or lacO. Furthermore, one of ordinary skill in the art will recognize 
that an operator may be positioned in a various locations with respect to the promoter 
provided that the operator retains sufficient ability to regulate transcription. 

The next example describes the construction of expression vectors containing 
1 0 regulatable fusion promoters. 

EXAMPLE 3 

Multipurpose Expression Vectors Containing Fusion Promoters 
In this section, the construction and use of expression vectors comprising 

15 fusion promoters is described. Because these expression vectors can accept a wide 
range of fusion promoters each having a range of transcriptional activity, these vectors 
are useful for finely modulating the amount of nucleic acid or polypeptide expressed 
inside a host cell throughout a wide range of concentrations. 

It will be appreciated that a regulatable fusion promoter suite for use in 

20 expression vectors can be constructed which provides a wide dynamic range of both 
basal and maximal transcriptional activities. Each fusion promoter within this suite 
can be tightly and finely regulated over its own characteristic range of transcriptional 
activity. Furthermore, the regulatable fusion promoters within the suite can comprise 
different promoters operably linked to the same operators thus providing for a 

25 common means of regulation of the promoters within the suite. For example, the 
promoters within the set of regulatable fusion promoters within the suite can be linked 
to at least one operator contained in a promoterless expression vector that is capable of 
replicating in one or more Gram-positive organisms. In other embodiments, a 
promoter operator fusion construct may be inserted into the vector as a cassette. 

30 Accordingly, a regulatable fusion promoter having an appropriate basal level and/or 
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maximal level of transcriptional activity can be selected from the suite for use in the 
appropriate application. As described below, fusion promoters can be quickly and 
easily exchanged in the expression vector as needed. 

Fusion promoters prepared as described in the preceding sections can be 
5 incorporated into existing expression vectors using conventional techniques in 
molecular biology. Embodiments include, for example, expression vectors having 
promoters of SEQ ID NOs.: 36 - 45 linked to operators such as xylO, xylOllacO or 
one or more operators including, but not limited to tetO, malO, lacO, trpO, and telO. 
In some embodiments, the expression vectors comprising the fusion promoters allow 
10 for the transcription and/or expression of a peptide protein or transcript in a plurality 
of Gram-positive and Gram-negative bacterial species (e.g., Escherichia coli, 
Staphylococcus species, and Enterococcus species). 

These vectors can also comprise any of the following elements: genes 
encoding assayable markers such as enzymes that produce detectable products, genes 
15 encoding selectable markers such as polypeptides that control drug resistance, one or 
more unique restriction sites positioned downstream of the promoter such that inserts 
cloned into the site can be transcribed from the promoter, one or more transcription 
terminators such as a rho independent or rho dependent terminators which are 
positioned downstream of the unique cloning sites so as to terminate transcription of 
20 the nucleic acid insert, an origin of replication functional in Gram-negative species 
such as Escherichia coli, and an origin of replication functional in Gram-positive 
species such as Staphylococcus or Enterococcus. 

For example, pXyl-T5 P15a is an expression vector which comprises the Xyl- 
T5 fusion promoter (see FIG. 5). This vector is 6858 bp in length and can be used to 
25 transcribe RNA and/or express a peptide or protein in Gram-positive bacteria 
including, but not limited to, Staphylococcus species. This vector comprises the Xyl- 
T5 fusion promoter positioned between the xylR gene and a multiple cloning site 
having unique restriction sites (e.g., EcoRJ, Sad, Xhol, Kpnl, Smal, Xmal, BamHI. 
Xbal. and Sail sites), the beta-lactamase gene from pLex5BA the P15a replicon 
30 (which maintains the vector at ^proximately 15 copies/cell in E. coli), and the pC194 



-87- 



WO 02/051982 



PCTAJS01/50250 



Staphylococcus replicon (which includes palA, palB, the pel 94 origin of replication, 
the rep gene, and a chloramphenicol acetyl transferase gene that imparts 
chloramphenicol resistance in Gram-positive bacteria). 

The pXyl-T5 pl5a vector was constructed from pXyl-T5 by replacing the E. 

5 coli ColEl origin of pXyl-T5, which allows replication of the vector at 100-200 copies 
per cell, with the PI 5a origin of pACYC184, which allows replication of the vector at 
approximately 15 copies per cell in E. coli. (Chang, A.C.Y. and Cohen, S.N., J. 
Bacteriol. 134:1141-1156 (1978), the disclosure of which is incorporated herein by 
reference in its entirety). The ColEl origin lies within two closely linked NotI sites. 

10 To obtain the Xyl plasmid having the ColEl origin, the pXyl-T5 vector was digested 
with NotI and the larger vector fragment was purified by preparative gel 
electrophoresis. Next, a derivative of plasmid pACYC184, which includes the P15a 
origin as a similar NotI cassette, was digested with NotI and the smaller PI 5a- 
encoding fragment was gel purified. (Diederich, L.et ah, BioTechniques 16: 916-923 

15 (1994), the disclosure of which is incorporated herein by reference in its entirety). 
The two purified NotI fragments were combined and ligated to form the vector pXyl- 
T5 P15a. Table 1 lists the components of the pXyl-T5 PI 5a vector and their locations. 
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TABLE 1 



Type 


Start 
Position 


End 

Position 


Name 


Description 


Gene 


176 


1327 


xylR 


The gene encoding repressor protein XylR 


Promoter 


1360 


1455 


A:T rich 

T5 
promoter 


The modified T5 promoter with upstream 
A:T rich box 


Region 


1455 


1494 


Multiple 
Cloning 
site 


Multiple cloning site includes following 
unique restriction sites: EcoRI, Sad, Xhol, 
Kpnl t SmaI P Xmal t BamHl Xbal, Sail 


Gene 


2903 


2043 


ampR 


The beta- lactamase gene from pLex5BA 


Region 


3037 


3852 


P15a 


A replicon that allows maintenance of 
vector at approximately 15 copies per cell 
in E. coli hosts. Inserted as NotI cassette at 
the Start and End numbers of the vector 
sequence. 


Region 


4302 


6659 




S. aureus replicon from pC194. Includes 
the five features described below: 


Region 


4462 


4302 


CpalA 


With palB y imperfect direct repeats 


Gene 


5242 


4592 


C Cat 


Gram-positive Chloramphenicol acetyl 
transferase gene-imparts chloramphenicol 
resistance in S. aureus. 


Region 


5562 


5399 


Cori 


pC194 origin of replication 


Gene 


5699 


6388 


rep 


gene for rep protein 


Region 


6659 


6626 


CpalE 





A plasmid map of pXyl-T5 PI 5a is provided in FIG. 5. The nucleotide 
5 sequence of pXyl-T5 Pl5a is provided in SEQ ID NO.: 16. 

It will be appreciated that corresponding expression vectors which include 
fusion promoters comprising either single or multiple operators can be constructed 
using conventional techniques in molecular biology. For example, pXyl/Lac-T5 pi 5a 
can be constructed by digesting pXyl-T5 pi 5a with Rsrll and EcoRI so as to remove 
10 the Xyl-T5 fusion promoter. After purification, the linear vector fragment is ligated to 
an RsrEL/EcoRI fragment comprising the pXyl/Lac-T5 promoter the conduction of 
which was described in Example 2. 

In another example, expression vectors functional in both Escherichia coli and 
Enterococcus faecalis were constructed by replacing the lacL-lacM reporter genes of 
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pEPEFl with the multiple cloning site (MCS) and rrnBtlt2 terminator of pLexP20. 
Specifically, pEPEFl was digested with Smal and Sail to remove the 3808 basepair 
fragment containing the lacL-lacM reporter genes. Following digestion, the large 
vector fragment was gel purified, the Sail overhanging end was made blunt with T4 
5 DNA polymerase, and the fragment was circularized by ligating the blunt ends. The 
resulting plasmid, termed pEPEF2, was then digested with BamHI, gel purified, and 
blunted with T4 DNA polymerase. The Clal/BstBl fragment containing the MCS and 
mnBtlt2 terminator was removed from pLexP20, gel purified, blunted using T4 DNA 
polymerase, then ligated to the blunt ended pEPEF2. A portion of the resulting 

10 ligation mixture was transformed into competent Escherichia coli and aliquots of the 
transformation mixture were plated on medium containing 20 ^ig/ml erythromycin. 
Isolated erythromycin-resistant transformants were picked and streaked to obtain 
single colony isolates. Plasmid DNA was then purified from representative single 
colony transformants. The presence and orientation of the MCS/terminator fragment 

15 was determined by PCR and DNA seqeuncing, respectively. A plasmid comprising 
the MCS/terminator fragment oriented such that the MCS was located adjacent to 
xylO was termed pEPEF3 (SEQ ID NO.: 17). 

It will be appreciated that expression vectors such as those described above 
can be used to construct derivative expression vectors which include fusion promoters 

20 comprising either single or multiple operators. For example, the CP25 promoter can 
be removed from pEPEF3 by digestion of this vector with RsrH and Xhol. After 
purification, the linear vector fragment is ligated to an RsrD/XhoI fragment 
comprising P59. A pEPEF based expression vector having a generic promoter, termed 
pEPEF-X is shown in FIG. 6. Accordingly, many expression vectors derived from 

25 either pXyl-T5 or pEPEF3 can be constructed by exchanging the promoter sequence 
contained therein with the desired promoter sequence. In some embodiments, an 
expression vector is constructed comprising a promoter selected from SEQ ID NOs.: 
36 - 45. pEPEF-X based expression vectors containing the Lactococcus lactis derived 
promoter sequences described herein, include pEPEF9 (SEQ ID NO.: 46), pEPEF14 

30 (SEQ ID NO.: 47), pEPEF14* (SEQ ID NO.: 48), P1P2/EF14 (SEQ ID NO.: 49), 
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pEPEF18 (SEQ ID NO.: 50), pEPEF20 (SEQ ID NO.: 51), and pEPEF22 (SEQ ID 
NO.: 52). 

Several methods of transcribing an RNA and/or expressing a peptide or protein 
are also embodiments included herein. More specifically, the fusion promoters 
5 described herein can be used to transcribe a sense or anti-sense RNA or can be used to 
express a recombinant peptide or protein in a cell. 

The next example describes experiments that were performed to evaluate the 
ability of the Xyl-T5 promoter to transcribe RNA. 

10 EXAMPLE 4 

Activity of Fusion Promoters by Real-Time RT-PCR 
To analyze the strength of the Xyi-T5 promoter that comprised the T5 
promoter operably linked to an AT Box and an xylO operator, transcript accumulation 
experiments were conducted using real-time RT-PCR (Reverse Transcriptase 

15 Polymerase Chain Reaction). RT-PCR detects a specific RNA by PCR amplification 
of a reverse transcriptase synthesized cDNA copy of the RNA. With real time RT- 
PCR, the accumulation of PCR amplified DNA is measured at each amplification step 
using a specific fluorogenically labeled oligonucleotide probe. The probe anneals to 
the newly formed cDNA product and is subsequently cleaved by the 5' nuclease 

20 activity of the Taq DNA Polymerase during DNA amplification. Cleavage of the 
probe results in an increase in fluorescent dye signal, which is used to monitor 
accumulation of the specific PCR product at each PCR amplification step. The PCR 
cycle time at which logarithmic amplification is reached is proportional to the initial 
amount of RNA template in the reaction. 

25 The levels of transcription from the Xyl-T5 promoter were evaluated in the 

following experiment. Fresh overnight colonies of Staphylococcus aureus RN4420 
containing plasmid a plasmid having the Xyl-T5 promoter were added to 10 ml of 
Luria Broth (LB) that was supplemented with 15 ^g/ml chloramphenicol (LB CM15). 
The cells were grown at 37°C with shaking until early log phase (OD600 between 0.1 

30 and 0.2) at which point they were diluted 1/10 into pre-wanned LB CM15. After one 
hour of growth at 37°C with shaking, three 1 ml aliquots of cells were pelleted and 
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quick frozen on dry ice. The remaining suspension was split into two flasks. Xylose 
was added to a final inducer concentration of 2.0% in one flask, whereas the second 
flask remained non-induced (control). The resulting transcript is approximately 300 
nucleotides in length and corresponds to vector sequence. 

5 Immediately after xylose induction (2 minute time point) and at 10, 20, 30 and 

60 minute time points, (three) 1 ml aliquots of cells were pelleted from each flask 
(with and without xylose) and quick frozen on dry ice. Cell pellets were stored at - 
80°C Total RNA was purified from each sample using a Qiagen 96 well RNeasy kit. 
Approximately, 1 ng of RNA (two samples for each time point) was used as template 

10 for RT-PCR with an oligonucleotide primer set (forward and reverse primers, and a 
fluorogenic probe) for specific detection of the 3' tail of the Xyl-T5 transcript. The 
reaction consisted of 300 nM each of the forward and reverse primers; 100 nM 
fluorogenic probe; 0.75 U AmpliTaq Gold (PE Biosystems); 7.5 U MultiScribe 
Reverse Transcriptase (PE Biosystems), Ix Buffer A (PE Biosystems); 5.8 mM MgCl, 

15 and 333 nM of each dNTP. The RT-PCR conditions were 48°C for 30 minutes, 95°C 
for 10 minutes, followed by 40 cycles of 95°C for 15 seconds and 60°C for 1 minute. 
RT-PCR reactions and detection were done in an ABI Prism 7700 Sequence Detection 
System. 

The results of the experiment described above are shown in FIG. 7. Within 2 
20 minutes of xylose induction, greater than a 30 fold excess of RNA was detected, as 
compared to the non-induced controls. At 60 minutes, greater than a 130 fold excess 
of RNA was detected in the xylose induced samples, as compared with non-induced 
bacteria. This example demonstrates that the fusion promoters described herein can 
be used to efficiently transcribe an RNA. Further, the data described herein verify that 
25 in the absence of xylose or an analog thereof Xyl-T5 promoter-driven transcription 
can be effectively repressed. 

It will be appreciated that the fusion promoters described above can be used to 
construct a regulatable fusion promoter suite which provides a wide dynamic range of 
both basal and maximal transcriptional activities. Each fusion promoter within this 
30 suite can be tightly and finely regulated over its own characteristic range of 
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transcriptional activity. Furthermore, the regulatable fusion promoters within the suite 
can comprise different promoters operably linked to the same operators thus providing 
for a common means of rapidly regulating the promoters within the suite. In some 
embodiments, the promoters within the set of regulatable fusion promoters within the 

5 suite can be linked to at least one operator contained in a promoterless expression 
vector that is capable of replicating in one or more Gram-positive organisms. 
Accordingly, a regulatable fusion promoter having an appropriate basal level and/or 
maximal level of transcriptional activity can be selected from the suite for use in the 
appropriate application. 

I o The example below demonstrates the relative strengths of the fusion promoters 

described herein. 

EXAMPLE 5 

Activity of Fusion Promoters by Expression of R eporter Genes 

15 This example describes the use of reporter gene constructs to measure the 

activity of fusion promoters such as those constructed in Examples 1 and 2. In one 
experiment, the pXyl-T5 and pXyl-T5 P15a vectors were tested to determine the 
relative level of expression of a reporter gene in both Escherichia coli and 
Staphylococcus aureus. To test the ability of the Xyl-T5 promoter to induce 

20 transcription and express a heterologous polypeptide, a construct was made in which 
the reporter gene was operably linked to the Xyl-T5 promoter. A pXyl-T5 vector was 
digested with Sad and Sail, and the linearized vector was purified by preparative gel 
electrophoresis. The linearized vector fragment was then ligated with a fragment 
containing the reporter gene. This construct was named pXyl-T5-Reporter. The pXyl- 

25 T5-Reporter vector containing the reporter gene under the control of the Xyl-T5 
promoter was then introduced into S. aureus and the cells were cultured in the 
presence and absence of 2.0% xylose. The transcription and translation of the gene 
product encoded by the reporter gene was evident when xylose was provided in the 
growth medium but in the absence of an agent that inhibits the binding of the xylose 

30 repressor to the xylose operator, production of the gene product encoded by the 
reporter gene was not detectable. 
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When the vector containing the reporter gene under the control of the Xyl-T5 
promoter was transformed in Escherichia coli, it was noted that expression of the gene 
product encoded by the reporter gene was very high, and uncontrollable either in the 
presence or absence of xylose. This may be because Escherichia coli does not express 

5 the XylR repressor. In those embodiments where it is desirable that Escherichia coli 
serve as a surrogate host, pXyl-T5 plSa, which has a reduced copy number in 
Escherichia coli can be used. 

To evaluate the effect of the PI 5a origin on the level of reporter gene 
expression in Staphylococcus aureus and Escherichia coli in the presence and absence 

10 of xylose, a construct similar to pXyl-T5-Reporter was made, using the pXyl-T5 pl5a 
vector. This construct was denoted pXyl-T5 pl5a-Reporter. A comparison of the 
induction of reporter gene expression of pXyl-T5-Reporter and pXyl-T5 pl5a- 
Reporter showed that the two vectors functioned identically when transformed and 
expressed in Staphylococcus aureus, but the P15a variant showed a markedly reduced 

15 expression of the reporter gene when transformed in Escherichia coll The 
demonstrates that the pi 5a origin of replication decreases reporter gene expression by 
reducing the copy number of the expression vector thereby reducing the number Xyl- 
T5 promoters in the cell. 

Fusion promoters contained on pEPEFl and its derivatives were evaluated in 

20 determine their strength in Enterococcus faecalis by measuring the P-galactosidase 
activity produced by expression of the lacL-lacM reporter genes under induced and 
uninduced conditions. In one experiment, derivatives of pEPEFl were prepared by 
replacing CP25 with the Lactococcus lactis derived promoters P32 (SEQ ID NO.: 
39), P59 (SEQ ID NO.: 40), or P1P2 (SEQ ID NO.: 42) as described in Example 1. 

25 pEPEFl and the resulting derivatives were each separately transformed into 
electrocompetent Enterococcus faecalis and the cells were cultured in the absense and 
the presence of 5% xylose. The p-galactosidase activity produced by the expression 
of the reporter genes was measured essentially as described in Israelsen et al. Appl. 
Environ. Microbiol 61:2540-47 (1995), the disclosure of which is incorporated herein 

30 by reference in its entirety, with the following modifications. After incubation with or 
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without xylose for four hours, the cells were harvested by centrifugation then 
resuspended in 1 ml of Z buffer comprising 0.06 M Na 2 HP0 4 • 7H 2 0, 0.04 M 
NaH 2 P0 4 • H 2 0, 0.01 M KC1, 0.001 M MgS0 4 ■ 7H 2 0, 0.05 M p-mercaptoethanol, at 
pH 7.0, with 0.6 ml of glass beads (150-212 micron diameter). The cells were lysed 
5 using a bead beater at maximun speed for 1 minute. After centrifugation, the 
supernatant was removed, vortexed with one drop of 0.1% SDS and 2 drops of 
chloroform at high speed for 10 seconds, then incubated with 200 nl of a 4 mg/ml o- 
nitrophenyl-P-D-galactopyranoside (ONPG) solution at 37°C. After approximately 20 
minutes, the reaction was stopped by adding 250 ^1 of 1M sodium carbonate. P- 
10 galactosidase activity was quantitated spectrophotometrically. All assays were 
performed in triplicate. 

FIG. 8 displays the amount of P-galactosidase activity produced by each fusion 
promoter in Enterococcus faecalis in the presence and absence of xylose. Promoters 
CP25 and P32 both produce very little p-galactosidase activity in the absence of 
15 xylose; however, in the presence of 5% xylose, both of these promoters express 
enough P-galactosidase to produce approximately 25 Miller units of activity. The 
activities of both P59 and P1P2 are much greater than that of either CP25 or P32. In 
the presence of xylose, P59 and P1P2 each produce between 125 and 150 Miller units 
of p-galactosidase activity. In the absence of xylose, P59 is strongly repressed having 
20 less than 4% of its induced activity. By contrast, P1P2 produces about 20% of its 
induced activity even in the absence of xylose. 

The above data shows that the fusion promoters described herein differ in the 
levels of transcription which they provide and the levels of transcription under non- 
inducing conditions. Some promoters are highly active under induced conditions 
25 whereas others show only a modest increase in activity upon induction. Furthermore, 
some promoters described herein may be strongly repressed in the absence of xylose 
whereas the activity of other promoters may only be partially inhibited. 

It will be appreciated that the fusion prompters described above can be used to 
construct a regulatable fusion promoter suite which provides a wide dynamic range of 
30 both basal and maximal transcriptional activities. Each fusion promoter within this 
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suite can be tightly and finely regulated over its own characteristic range of 
transcriptional activity. Furthermore, the reguiatable fusion promoters within the suite 
can comprise different promoters operably linked to die same operators thus providing 
for a common means of regulation of the promoters within the suite. In some 
5 embodiments, the promoters within the set of reguiatable fusion promoters within the 
suite can be linked to at least one operator contained in a promoterless expression 
vector that is capable of replicating in one or more Gram-positive organisms. 
Accordingly, a reguiatable fusion promoter having an appropriate basal level and/or 
maximal Level of transcriptional activity can be selected from the suite for use in the 
1 0 appropriate application. 

In another experiment, the activity of the CP25 fusion promter was compared 
to the activity of Xyl-T5. To produce a pXyl-T5 pi 5a vector containing the lacL- 
lacM reporter, the purified 4 kb Hindm/Sall fragment of pAK80 which contains the 
lacLAacM reporter construct was ligated into purified pXyl-T5 pi 5a which had been 
15 linearized using HindEI and Sail. The resulting constuct was termed pXyI-T5 pi 5a 
LacLM. To compare the activity of the T5 fusion promoter with that of CP25, pXyl- 
T5 pl5a LacLM and pEPEFl were transformed separately into Enterococcus faecalis 
and the cells were cultured in the presence of 1% and 5% xylose and in its absence. 
The p-galactosidase activity produced by the expression of the reporter genes was 
20 measured using the Miller assay as described above. 

FIG. 9 displays the amount of P-galactosidase activity produced by each fusion 
promoter under induced and uninduced conditions. In the absence of xylose, both the 
fusion promoters are strongly repressed in Enterococcus faecalis. When induced with 
1% xylose, CP25 has about twice the activity of Xyl-T5 in Enterococcus faecalis. At 
25 5% xylose, CP25 produces about 35 Miller units of activity compared to about 22 
Miller units by Xyl-T5. 

To determine whether there is a difference in the level of gene expression from 
pXly-T5 pl5a in Staphylococcus aureus and Enterococcus faecalis, pXyl-T5 pl5a 
LacLM was also transformed into Staphylococcus aureus which was grown in the 
30 presence of 1% and 5% xylose and in its absensce. The p-galactosidase activity 
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produced by the expression of the reporter genes was measured using the Miller assay 
as described above. 

FIG. 9 shows that the activity of Xyi-T5 is similar in Staphylococcus aureus 
and Enterococcus faecalis. When induced with 5% xylose in each organism, the Xyl- 
5 T5 fusion produces appoximately 22 Miller units of p-galactosidase activity. 

The above data shows that the fusion promoters disclosed herein have the 
ability to function in a variety of Gram-positive organisms and to maintain their 
characteristic activities when transferred from one host to another. This result is in 
constrast to the alteration in activity that is seen when the promoter is transferred to 
10 Gram-negative organisms. One example is the unregulated activity of Xyl-T5 in in 
Escherichia colL 

The example below demonstrates that the activity of the fusion promoters 
described herein can be fmely regulated over a range of inducer concentrations. In 
other words, the activity of these fusion promoters is titratable. 

15 

EXAMPLE 6 

Titratable Expression from Fusion Promoters 
This example shows that the activity of promoters described herein is titratable 
with inducer thus allowing a reproducible, incremental modulation of promoter 
20 activity by altering the concentration of inducer. Each fusion promoter described 
herein possesses its own characteristic activity over a range of xylose concentrations. 
Within this range, the amount of RNA transcribed from a fusion promoter can be 
finely regulated in repsonse to the concentration of xylose used to induce the 
promoter. 

25 In one example, the derivative of pEPEFl containing the P59 promoter 

(pEPEFl-P59) was transformed into Enterococcus faecalis and the transformants were 
cultured in the presence of increasing concentrations of xylose (from 0.1% to 10%) 
and in its absense. The P-galactosidase activity produced by the expression of the 
lacL-lacM reporter genes was measured using the Miller assay as described in 
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Example 5. FIG. 10 shows that the activity of the P59 promoter increases in response 
to induction with increasing concentrations of xylose. 

The next example describes modifications to fusion promoters that effect their 
transcriptional activity. 

5 

EXAMPLE 7 

Modifications of Fusion Promoters that Alter Transcription Efficiency 
This example describes the construction of fusion promoters that have been 
modified to have altered transcriptional activity in Gram-positive organisms. More 

10 specifically, modified fusion promoters, which are derived from wildtype promoters, 
have been constructed by creating mutations in the promoter sequence which either 
increase or diminish the strength of the promoter. 

In one example, the strong Lactococcus lactis promoter, PL (SEQ ID NO.: 44), 
was modified to increase its activity in Gram-positive organisms. First, a regulatable 

15 PL fusion promoter was constructed by linking the PL promoter to a xylose operator 
using the methods described previously. Examination of the PL sequence revealed 
that changing the C at position 45 of SEQ ID NO.: 44 to an A would make the -10 
Box region of promoter PL identical to -10 Box consensus site reported for Gram- 
positive organisms (TATAAT). PL10, a derivative of PL which incorporates this 

20 single basepair modification, was constructed by annealing the complementary 
synthetic oligonucleotides of SEQ ID NOs.: 18 and 19 using the procedure described 
in Example 1. 

5 ' GACCGTTTCGTGAACTTTTTTGTTG AC AAAG AT AAAAAC AC ATG ATATA 
25 ATT AAATC AC3 * (SEQ ID NO.: 18) 

5 * JCGi4GTG ATTT AATTATATC ATGTGTTTT^ 
CACGAAACG3' (SEQ ID NO.: 19) 
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The underlined nucleotides comprise the 5 '-overhang of the Rsrll recognition 
site whereas the italicized nucleotides correspond to the 5' -overhang of the Xhol 
recognition site. The double stranded promoter formed by the annealing of these two 
oligonucleotides was then ligated with purified, RsrII/XhoI-digested pEPEFl to form a 

5 pEPEFl derivative containing the Xly-PLIO fusion promoter. This derivative was 
termed pEPEFl-PLlO. To determine the effect of this modification, the strengths of 
PL10, PL and P59 were compared by separately transforming pEPEFl derivatives 
containing the appropriate promoter into Enterococcus faecalis then culturing the cells 
in the absense or presence of 5% xylose. The P-galactosidase activity produced by the 

10 expression of the lacLAacM reporter genes was measured using the Miller assay as 
described in Example 5. 

FIG. 11 compares the strengths of the PL, PL10 and P59 fusion promoters 
expressing the lacL-lacM reporter genes. When induced with 5% xylose, the activity 
of PL10 is about 20% greater than that of the wildtype PL promoter. Both PL and 

15 PL10 have substantially greater activity than P59 under induced conditions. In the 
absence of xylose, P59 is almost completely repressed whereas both PL and PL10 
show significant basal activity. 

In another example, P59 was modified to increase its activity in Gram-positive 
organisms. Examination of the P59 sequence revealed that changing the G at position 

20 37 of SEQ ID NO.: 40 to a T would make the -10 Box region of promoter P59 
identical to -10 Box consensus site in Gram-positive organisms. Additionally it 
appeared that addition of an AT rich region upstream of the -35 Box might enhance 
promoter activity. A derivative of P59 which incorporates these modifications was 
constructed by annealing the complementary synthetic oligonucleotides of SEQ ID 

25 NOs. : 20 and 2 1 using the procedure described in Example 1 . 

S'GACCGAAAAATGACAGTTTATTCTTGACAGGGAGAGATAGGTTTGATAT 
AATATAATAGTTGTC3' (SEQ ID NO.: 20) 
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5' TrG^GACAACTATTATATTATATCAAACCTATCTCTCCCTGTCAAGAATA 
AACTGTCATTTTTCG3' (SEQ ID NO.: 21) 

The underlined nucleotides comprise the 5*-overhang of the RsrII recognition 

5 site whereas the italicized nucleotides correspond to the 5 '-overhang of the Xhol 
recognition site. The double stranded promoter formed by the annealing of these two 
oligonucleotides was then ligated with purified, RsrII/XhoI-digested pEPEFl, 
transformed into Efaecalis, and assayed as previously described. When compared to 
the unmodified P59, the derivative had about 20% higher activity. 

10 It will be appreciated that modified fusion promoters, such as those described 

above, can be used to construct a regulatable fusion promoter suite which provides a 
wide dynamic range of both basal and maximal transcriptional activities. Modifying 
regulatable fusion promoters so as to increase their activity expands the range of both 
basal and maximal transcriptional activities of a suite of regulatable fusion promoters. 

15 Each modified fusion promoter within this suite can be tightly and finely regulated 
over its own characteristic range of transcriptional activity. Furthermore, the modified 
regulatable fusion promoters within the suite can comprise different promoters 
operably linked to the same operators thus providing for a common means of 
regulation of the promoters within the suite. In some embodiments, the modified 

20 promoters within the set of regulatable fusion promoters within the suite can be linked 
to at least one operator contained in a promoterless expression vector that is capable of 
replicating in one or more Gram-positive organisms. Accordingly, a regulatable 
fusion promoter having an appropriate basal level and/or maximal level of 
transcriptional activity can be selected from the suite for use in the appropriate 

25 application. 

In yet another example, the Xly-T5 fusion promoter was modified to decrease 
its activity in Gram-positive organisms. By changing four nucleotides in the -35 Box 
of SEQ ID NO.: 36 (G to T at postion 18, C to A at position 19, T to C at position 20 
and T to A at position 21) and one in the -10 Box of SEQ ID NO.: 36 (A to T at 
30 position 43) the Xyl-T5 promoter could be altered to include -10 and -35 elements that 
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are identical to those of the weak xylA promoter. (For a description of the xylA 
promoter, see Sizemore et aL, Mol Gen. Genet 227:377-384 (1991) and 
Schnappinger, D. et al 9 FEMS Microbiology Letters 129: 121-128 (1995), the 
disclosures of which are incorporated herein by reference in their entireties). Xyl-T5- 
5 DD, a derivative of Xyl-T5 which incorporates these modifications was constructed by 
annealing the complementary synthetic oligonucleotides of SEQ ID NOs.: 22 and 23 
using the procedure described in Example 1 . 

5'GACCGTCATAAAAAATTTATTTTACATCAGGAAAA 
10 GATTCAAGTT AGTTTGTTTATTAAAT^ ' (SEQ ID 

NO.: 22) 

5 J AA 77UT AC ATTTT AGTTGGTTAATTT^ 

ATATACAGAAAAATTTTCCTGATGTAAAATAAATTTTTTATGACG3' (SEQ 
15 ED NO.: 23) 

The underlined nucleotides comprise the 5'-overhang of the RsrII recognition 
site whereas the italicized nucleotides correspond to the 5 '-overhang of the EcoRI 
recognition site. The double stranded promoter formed by the annealing of these two 

20 oligonucleotides was then ligated with purified, Rsrll/EcrRI-digesizd pXyl-T5- 
Reporter (prepared as described in Example 5). The pXyl-T5-Reporter vector was 
then introduced into S. aureus and the cells were cultured in the presence and absence 
of 2.0% xylose and expression levels were measured. Xyl-T5-DD had only about 
10% of the activity of unmodified Xyl-T5. The nucleotide sequence of pXyi-T5-DD 

25 PI5a, a Xyl-T5-DD based expression vector, is provided in (SEQ ID NO.: 53). 

It will be appreciated that modified fusion promoters, such as those described 
above, can be used to construct a regulatable fusion promoter suite which provides a 
wide dynamic range of both basal and maximal transcriptional activities. Modifying 
regulatable fusion promoters so as to decrease their activity expands the range of both 

30 basal and maximal transcriptional activities of a suite of regulatable fusion promoters. 
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Each modified fusion promoter within this suite can be tightly and finely regulated 
over its own characteristic range of transcriptional activity. Furthermore, the modified 
regulatable fusion promoters within the suite can comprise different promoters 
operably linked to the same operators thus providing for a common means of 
5 regulation of the promoters within the suite. In some embodiments, the modified 
promoters within the set of regulatable fusion promoters within the suite can be linked 
to at least one operator contained in a promoterless expression vector that is capable of 
replicating in one or more Gram-positive organisms. Accordingly, a regulatable 
fusion promoter having an appropriate basal level and/or maximal level of 
10 transcriptional activity can be selected from the suite for use in the appropriate 
application. 

It will be appreciated that the methods described herein can be used to create 
promoter sequence modifications that either enhance or reduce promoter activity. 
These modifications can be made either alone or in combination. Among these 

15 changes are modifications of nucleotides within the promoter's -35 and -10 Box 
elements which make the promoter more or less like the consensus -35 Box and -10 
Box sequences (TTGACA and TATAAT, respectively). Other modifications include 
altering the length of the region between the operator or transcription start site and the 
-10 Box, altering the length and/or composition of the region between the -10 Box 

20 and the -35 Box, and altering the length and/or composition of the AT rich region 
upstream of the -35 Box (i.e. the -45 Box). It will also be appreciated that 
modifications which affect promoter activity can include addition of elements such as 
the CRE region for catabolite repression. These additional elements can be further 
modified so that they are more or less like their corresponding consensus sequence. 

25 Promoters which incorporate one or more modifications similar to those described 
above are represented by SEQ ID NOs.: 37, 41, 43 and 45. 

The experiments described in the Examples 4 -7 verify that the fusion 
promoters disclosed herein can be used to tightly and finely regulate transcription of 
an RNA or expression of a peptide or a protein throughout a range of activity levels. 
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It will be appreciated that both modified and unmodified fusion promoters, 
such as those described above, can be used to construct a regulatable fusion promoter 
suite which provides a wide dynamic range of both basal and maximal transcriptional 
activities. Modifying regulatable fusion promoters so as to increase or decrease their 
5 activity expands the range of both basal and maximal transcriptional activities of a 
suite of regulatable fusion promoters. The addition of unmodified fusion promoters 
having unique basal and/or maximal transcriptional activities also expands the 
dynamic range of the suite. Each regulatable fusion promoter within this suite can be 
tightly and finely regulated over its own characteristic range of transcriptional activity. 

10 Furthermore, the regulatable fusion promoters within the suite can comprise different 
promoters operably linked to the same operators thus providing for a common means 
of regulation of the promoters within the suite. In some embodiments, the promoters 
within the set of regulatable fusion promoters within the suite can be linked to at least 
one operator contained in a promoterless expression vector that is capable of 

15 replicating in one or more Gram-positive organisms. Accordingly, a regulatable 
fusion promoter having an appropriate basal level and/or maximal level of 
transcriptional activity can be selected from the suite for use in the appropriate 
application. Such a suite of promoters is useful in the methods of identifying genes 
required for proliferation or methods of identifying compounds which inhibit 

20 proliferation described herein. 

The examples below discusses methods that use fusion promoters to identify 
essential genes and compounds having potential antibiotic activity. 

EXAMPLE 8 

25 Identification of Genes Required for Cellular Proliferation by Expressing 

Proliferation- Inhibiting Antisense Nucleic Acids from a Fusion Promoter 
This example describes the examination of a library of candidate antisense 
nucleic acid operably linked to fusion promoters to identify essential genes in 
Staphylococcus aureus and Enterococcus faecalis. Although the following examples 
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utilize the fusion promoters Xyl-T5, CP25 and P59, it will be appreciated that any of 
the fusion promoters described herein may be used. 

It will be appreciated that a regulatable fusion promoter suite which provides a 
wide dynamic range of both basal and maximal transcriptional activities can be used 
5 to facilitate the discovery of proliferation-required genes. Each fusion promoter 
within this suite can be tightly and finely regulated over its own characteristic range of 
transcriptional activity. Furthermore, the regulatable fusion promoters within the suite 
can comprise different promoters operably linked to the same operators thus providing 
for a common means of regulation of the promoters within the suite. In some 

10 embodiments, the promoters within the set of regulatable fusion promoters within the 
suite can be linked to at least one operator contained in a promoterless expression 
vector that is capable of replicating in one or more Gram-positive organisms. 
Accordingly, a regulatable fusion promoter having an appropriate basal level and/or 
maximal level of transcriptional activity can be selected from the suite based on the 

15 expression level of the proliferation-required genes that are targeted for discovery. 

In the methods for identifying genes encoding gene products required for 
cellular proliferation using antisense nucleic acids expressed from an inducible fusion 
promoter, random genomic fragments are obtained from the organism in which it is 
desired to identify genes required for cellular proliferation. The random genomic 

20 fragments may be generated by a partial digestion with a restriction enzyme, 
mechanical shearing, using techniques such as sonication and nebulization, or DNAsel 
digestion. The random genomic fragments are operably linked to a regulatable fusion 
promoter in a vector, such as one of the expression vectors described herein. In those 
instances where the inserted genomic fragments are in an antisense orientation with 

25 respect to the promoter, the transcript produced is complementary to at least a portion of 
an mRNA encoding a gene product such that they interact with sense mRNA produced 
from various genes and thereby decrease the translation efficiency or the level of the 
sense messenger RNA (mRNA) thus decreasing production of the protein encoded by 
these sense mRNA molecules. In cases where the sense mRNA encodes a protein 

30 required for proliferation, cells grown under inducing conditions fail to grow or grow at 
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a substantially reduced rate. Additionally, in cases where the transcript produced is 
complementary to at least a portion of a non-translated RNA and where that non- 
translated RNA is required for proliferation, cells grown undo* inducing conditions also 
fail to grow or grow at a substantially reduced rate. In contrast, cells grown under non- 
5 inducing conditions grow at a normal rate. 

The genes to which the antisense nucleic acids are complementary are then 
identified and utilized in the methods of the present invention. Thus, to identify genes 
required for cellular proliferation, the extent of proliferation of cells containing the 
vectors in the presence of an inducer which induces transcription from the regulatable 

10 fusion promoter is compared to the extent of proliferation of cells in the absence of the 
inducer. Those cells which grow well in the absence of the inducer but exhibit 
significantly reduced proliferation in the presence of the inducer contain a vector 
encoding an antisense nucleic acid complementary to at least a portion of a gene 
required for cellular proliferation. 

15 Use of the above method to identify genes required for cellular proliferation in 

E. coli, Staphylococcus aureus, Salmonella typhimurium, Klebsiella pneumoniae, 
Pseudomonas aeruginosa and Enterococcus faecalis has been described in the 
following U.S. Patent Applications, the disclosures of which are incorporated herein 
by reference in their entireties: U.S. Patent Application Serial Number 09/815,242, 

20 filed March 21, 2001; U.S. Patent Application Serial Number 09/492,709, filed 
January 27, 2000; U.S. Patent Application Serial Number 09/71 1,164, filed November 
9, 2000; U.S. Patent Application Serial Number 09/741,669, filed December 19, 2000; 
U.S. Patent Application Serial Number 09/815,242, filed March 21, 2001; and U.S. 
Patent Application Serial Number 09/948,993, filed September 06, 2001. The 

25 methods used to identify genes required for cellular proliferation in Staphylococcus 
aureus, and Enterococcus faecalis are summarized below. 

Nucleic acids involved in proliferation of Staphylococcus aureus were 
identified using the EGI technique as follows. A shotgun library of Staphylococcus 
aureus genomic fragments was cloned into the vector pXyIT5-P15a, which harbors 

30 the Xyl-T5 inducible promoter. The vector was linearized at a unique BamHl site 



-105- 



WO 02/051982 



PCT/US01/50250 



immediately downstream of the XyIT5 promoter/operator. The linearized vector was 
treated with shrimp alkaline phosphatase to prevent reclosure of the linearized ends. 
Genomic DNA isolated from Staphylococcus aureus strain RN450 was fully digested 
with the restriction enzyme Sau3A or alternatively, partially digested with DNase I 
5 and "blunt-ended" by incubating with T4 DNA polymerase. Random genomic 
fragments between 200 and 800 base pairs in length were selected by gel purification. 
The size-selected genomic fragments were added to the linearized and 
dephosphorylated vector at a molar ratio of 0.1 to 1, and ligated to form a shotgun 
library. 

10 The ligated products were transformed into electrocompetent E. coli strain 

XLl-Blue MRF (Stratagene) and plated on LB medium with carbenicillin at 
100}ig/ml. Resulting colonies numbering 5 x 10 5 or greater were scraped and 
combined, and were then subjected to plasmid purification. 

The purified library was then transformed into electrocompetent 

15 Staphylococcus aureus strain RN4220. Resulting trans formants were plated on LBG 
agar with chloramphenicol at 15 jig/ml in order to generate 100 to 150 platings at 500 
colonies per plating. The colonies were subjected to robotic picking and arrayed into 
wells of 384 well culture dishes. Each well contained lOOjil of LBG + CM15 liquid 
medium. Inoculated 384 well dishes were incubated 16 hours at 37°C, and each well 

20 was robotically gridded onto solid LBG + CM15 medium with or without 2% xylose. 
Gridded plates were incubated 16 hours at 37°C, and then manually scored for arrayed 
colonies that were growth-compromised in the presence of xylose. Such growth 
sensitivity occurred at a frequency of one in sixty arrayed colonies. 

Arrayed colonies that were growth-sensitive on medium containing 2% xylose, 

25 yet were able to grow on similar medium lacking xylose, were subjected to further 
growth sensitivity analysis as follows: Colonies from the plate lacking xylose were 
manually picked and inoculated into individual wells of a 96 well culture dish 
containing LBG + CM15, and were incubated for 16 hours at 37°C. These cultures 
were robotically diluted 1/100 into fresh medium and allowed to incubate for 4 hours 

30 at 37°C, after which they were subjected to serial dilutions in a 384 well array and 
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then gridded onto media with and without 2% xylose. After growth for 16 hours at 
37°C, the arrays of serially diluted spots that resulted were compared between the two 
media. Spots that grew similarly on both media were scored as a negative and 
corresponding colonies were no longer considered. Spots on xylose medium that 
5 failed to grow to the same serial dilution compared to those on the non-xylose plate 
were given a score based on the differential, i.e. should the spots on xylose only grow 
to a serial dilution of -4 while they were able to grow to -8 on the non-xylose plate, 
then the corresponding transformant colony received a score of "4" representing the 
log difference in growth observed. 

10 Nucleic acids involved in proliferation of E> faecalis were identified as 

follows. A shotgun library of E. faecalis genomic fragments was cloned into either 
pEPEF3 or pEPEFU, which contain the CP25 or P59 promoter, respectively, 
regulated by the xyl operator/represser. The vector was linearized at a unique Smal 
site immediately downstream of the promoter/operator. The linearized vector was 

1 5 treated with alkaline phosphatase to prevent reclosure of the linearized ends. Genomic 
DNA isolated from E. faecalis strain OG1RF was partially digested with DNase I and 
"blunt-ended" by incubating with T4 DNA polymerase. Random genomic fragments 
between 200 and 800 base pairs in length were selected by gel purification. The size- 
selected genomic fragments were added to the linearized and dephosphorylated vector 

20 at a molar ratio of 2 to 1, and ligated to form a shotgun library. 

The ligated products were transformed into electrocompetent E. coli strain 
TOP10 cells (Invitrogen) and plated on LB medium with erythromycin (Erm) at 150 
Hg/mL. Resulting colonies numbering 5 x 10 5 or greater were scraped and combined, 
and were then subjected to plasmid purification. 

25 The purified library was then transformed into electrocompetent E. faecalis 

strain OG1RF. Resulting transformants were plated on Todd-Hewitt (TH) agar with 
erythromycin at 10 \xg/mL in order to generate 100 to 150 platings at 500 colonies per 
plating. The colonies were subjected to robotic picking and arrayed into wells of 384 
well culture dishes. Each well contained 100 of THB + Erm 10 jig/mL. 

30 Inoculated 384 well dishes were incubated 16 hours at room temperature, and each 
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well was robotically gridded onto solid TH agar + Enn with or without 5% xylose. 
Gridded plates were incubated 16 hours at 37 °C, and then manually scored for 
arrayed colonies that were growth-compromised in the presence of xylose. 

Arrayed colonies that were growth-sensitive on medium containing 5% xylose, 
5 yet were able to grow on similar medium lacking xylose, were subjected to further 
growth sensitivity analysis. Colonies from the plate lacking xylose were manually 
picked and inoculated into individual wells of a 96 well culture dish containing THB + 
Enn 10, and were incubated for 16 hours at 30 °C. These cultures were robotically 
diluted 1/100 into fresh medium and allowed to incubate for 4 hours at 37 °C, after 

10 which they were subjected to serial dilution on plates containing 5% xylose or plates 
lacking xylose. After growth for 16 hours at 37 °C, the arrays of serially diluted spots 
that resulted were compared between the two media. Colonics that grew similarly on 
both media were scored as a negative and corresponding colonies were no longer 
considered. Colonies on xylose medium that failed to grow to the same serial dilution 

15 compared to those on the non-xylose plate were given a score based on the 
differential. For example, colonies on xylose medium that only grow to a serial 
dilution of -4 while they were able to grow to -8 on the non-xylose plate, then the 
corresponding transformant colony received a score of "4" representing the log 
difference in growth observed. 

20 Of the 1.4 million Enterococcus faecalis colonies screened using the CP25 

fusion promoter, about 1 of every 3750 colonies had reduced proliferation ability 
whereas of the 390,000 colonies screened using the P59 fusion promoter, about 1 of 
every 390 colonies had reduced proliferation ability. 

Thus, it will be appreciated that use of a regulatable fusion promoter from the 

25 suite of regulatable fusion promoters which has a higher level of transcriptional 
activity may increase the frequency at which proliferation-required genes are 
identified and may allow identification of proliferation-required genes which would 
not be identified using weaker promoters. Such a fusion promoter would be expected 
to produce a larger number of colonies having reduced proliferation ability per 
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colonies screened Furthermore, proliferation-required genes from new gene classes 
would be expected. 

Expression vectors were purified from Staphylococcus aureus or Enterococcus 
faecalis colonies identified as having reduced proliferation upon induction of the 
fusion promoter. The nucleic acids contained on the expression vectors under control 
of the fusion promoter were isolated for subsequent nucleotide sequence 
determination and further characterization. 

It will be appreciated that the random genomip fragments used in the above 
methods can be generated using restriction enzymes and endonucleases other than 
those specifically described or by any other procedure. In addition, random genomic 
fragments may be generated by mechanical shearing. Sonication and nebulization are 
two such techniques commonly used for mechanical shearing of DNA. 

The next example describes experiments that use the Rapid Replacement of 
Genomic Promoters technique to identify essential genes for use as targets for drug 
discovery. 

EXAMPLE 9 

Replacement of Genomic Promoters for Proliferation-Required Genes with Inducible 

Fusion Promoters 

20 Nucleic acids involved in proliferation of Staphylococcus aureus or 

Enterococcus faecalis are identified using the Rapid Replacement of Genomic 
Promoters technique described in U.S. Patent Application Serial Number 09/948993, 
filed September 6, 2001, the disclosure of which is incorporated herein by reference in 
its entirety, and U.S. Provisional Patent Application Serial Number 60/230347, filed 

25 September 6, 2000, the disclosure of which is incorporated herein by reference in its 
entirety. Initially, a target gene is selected. The gene may be a gene which is known 
or suspected to be essential for proliferation or a gene which has not been 
characterized with respect to its essentiality for proliferation. Identification of the 
essential gene can be accomplished by using the EGI technique or other methods 

30 including, but not limited to, genes shown to be essential in the literature or genes of ^ 
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unknown essentiality that are predicted to be essential by bioinformatics. 
Additionally, one can target essential operons containing multiple genes. 

Next, the native promoter of the target gene or operon is replaced with a 
regulatable promoter, such as the fusion promoters described herein using "ET 
5 cloning" technology. Briefly, this is accomplished by engineering the one of the 
fusion promoters described herein such that it is 3' of a selectable marker (e.g., a CAT 
gene). This entire cassette is amplified by PCR. This PCR product becomes the 
template for another round of PCR using primers with 80bp of homology to the target 
promoter and 20bp of homology to the fusion promoter-containing cassette. However, 

10 as little as 60bp homology to the target promoter can be used. The region of 
homology is chosen such that the cassette will replace the promoter of the target gene 
without effecting the endogenous Shine-Delgarno sequence. This construct is then 
transformed into competent cells containing various levels of inducer (e.g. xylose or 
analog thereof) to assure that the correct level of expression is achieved to allow 

15 survival. In some embodiments, the native promoter in a strain having an enhanced 
frequency of homologous recombination may be replaced with a regulatable promoter. 

The correct integration of the cassette can be confirmed by colony PCR as 
described in U.S. Patent Application Serial Number 09/948993, filed September 6, 
2001, and U.S. Provisional Patent Application entitled Rapid PCR Method for 

20 Determination of Whether a Gene is Essential, Serial Number 60/230347, filed 
September 6, 2000, attorney docket number ELITRA.022PR, the disclosures of which 
are incorporated herein by reference in their entireties. Preferably, correct integrants 
are tested for growth defects as inducer is titrated away. The inability to grow in the 
absence of inducer will confirm that the target was essential. These strains can then 

25 be induced to a level determined to be the optimal balance between slowed growth 
and sensitivity to inhibition of the target protein for use in a cell-based assay for 
compounds with antibiotic activity or may be used to identify genes required for 
proliferation as described herein. 

The next example describes experiments that use an operator insertion technique 

30 to identify essential genes for use as targets for drug discovery. 
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EXAMPLE 10 

Regulation of Genomic Promoters for Proliferation-Required Genes bv Operator 

Insertion 

5 In this example, insertion of an operator into the promoter of a proliferation- 

required gene is described. A target promoter which drives the expression of a target 
gene or operon is identified. The target gene may be a gene which is known to be 
required for proliferation, suspected to be required for proliferation, or a gene which 
has not been characterized with respect to whether it is required for proliferation. An 

10 oligonucleotide comprising a xyl, tet, trp, mal, 7x1 or lac operator flanked on each 
side by 40 nucleotides homologous to the target promoter is synthesized. The 40 
nucleotide flanking sequences are determined based on the desired location for 
intergration of the xyl operator into the promoter. The single stranded oligonucleotide 
construct is then transformed into a bacterium having an enhanced frequency of 

15 homologous recombination. For example, the bacterium may express the X Beta and 
Gam proteins. The cells in the transformation mixture are diluted and plated on 
medium containing xylose. Colonies in which the operator has integrated into the 
target promoter are identified by colony PCR. The identified colonies are grown in 
medium containing or lacking inducer. The colonies proliferate on medium 

20 containing inducer but fail to grow on medium lacking inducer, thereby indicating the 
target promoter directs transcription of a gene encoding a gene product required for 
proliferation. Strains in which a gene encoding a gene product required for 
proliferation is under the control of the xyl, tet, trp, mal, Zcl or lac operator may also 
be used to identify compounds which inhibit proliferation in the cell-based asays 

25 described herein. 

It will be appreciated that operators other than the xyl, tet, trp, mal, Xcl or lac 
operator can be used with the methods described above. 

The next example describes experiments that use the Random Insertional Cis- 
Antisense technique to identify essential genes for use as targets for drug discovery. 

30 
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EXAMPLE 11 

Insertional Inactivation of Proliferation-Required Genes by Ran dom Insertion of an 

Inducible Fusion Promoter 
Nucleic acids involved in proliferation of Staphylococcus or Enterococcus are 

5 identified using the Random Insertional Cis-Antisense technique described in U.S. 
Provisional Patent Application Serial Number 60/230,403, filed September 6, 2000, 
the disclosure of which is incorporated herein by reference in its entirety. In one 
embodiment, a vector containing an outward-directed inducible fusion promoter is 
randomly integrated into the genome of a Gram-positive organism such as S. aureus or 

10 E. faecalis or the other Gram-positive organisms described herein and clones 
containing promoters in cis-antisense orientation with regard to essential genes are 
identified. Accordingly, a vector containing an inducible promoter, such as one of the 
fusion promoters described herein, is first engineered such that the inducible promoter 
reads into a multiple cloning site (MCS). The genetic marker is juxtaposed directly 3' 

15 of the MCS. This positioning helps ensure that the transforming vector will not be 
degraded by exonucleases (transformants in which the vector is degraded will not be 
recovered because the genetic marker will be lost). The vector is linearized by cutting 
with any single restriction enzyme in the MCS. Using different enzymes to linearize 
may increase the randomness of integration of the vector. The linearized vector is 

20 then transformed into the appropriate bacterial strain. 

The intergrative transformation frequency of linear DNA varies widely among 
bacterial strains. In some strains, linear vectors usually transform integratively at a 
significantly higher frequency than circular vectors. This is especially true in bacterial 
strains that have been modified to have an enhanced frequency of homologous 

25 recombination. In strains in which the frequency of integrative transformation of 
linear vectors is low, circular vectors often transform integratively at a higher 
frequency. The mechanism of integration is largely unknown but it is contemplated 
that integration occurs via nonhomologous recombination. Sites of integration appear 
to be entirely random. Transformants are robotically picked, grown, and are 

30 robotically gridded onto inducing and noninducing (inducing or not inducing 
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transcription of the promoter) media. These gridded plates are then screened for 
transfonnants that have a pronounced growth defect on inducing media relative to the 
phenotype on noninducing media Such transformants may have the vector integrated 
into their genome in several possible locations in an essential gene: the 3* untranslated 

5 region or any part of the coding or noncoding region that does not result in disruption 
of function of the gene product In most cases the inducible promoter is oriented such 
that it produces an antisense RNA from the target gene. Thus, when the promoter is 
induced the expression of the essential gene is reduced. Such inhibition can occur by 
one of several mechanisms including binding of the antisense RNA to the mRNA, 

10 colliding RNA polymerases, RNA interference (RNAi) or a combination of such 
phenomenon. 

The identity of the essential gene is then determined by sequencing the 
junction of the vector with the essential gene, the junction where the inducible 
promoter is juxtaposed with genomic DNA. This assay can also be used to create 
15 bacterial strains for cell-based assays that evaluate candidate antibiotic compounds as 
described herein. 

The next example describes approaches that were used to determine the 
identity of the newly discovered essential genes. 

20 EXAMPLE 12 

Nucleotide Sequence Determination of Genes that are Affected by Proliferation- 
Inhibiting Antisense Nucleic Acids Expressed from Fusion Promoters 
With reference to experiments that were performed using the antisense 
technique, this example describes approaches that were used to determine the nucleic 
25 acid sequence of newly discovered essential genes. It should be understood that the 
teachings herein can also be used to determine the nucleic acid sequence of essential 
genes identified using the promoter replacement, operator insertion or Random 
Insertional Cis- Antisense technique. 

Plasmids from transformant colonies that received a dilution plating score of 
30 "2" or greater were isolated to obtain the genomic DNA insert responsible for growth 
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inhibition as follows. Staphylococcus aureus were grown in standard laboratory 
media (LB or TB with 15 ug/ml Chloramphenicol to select for the plasmid). Growth 
was carried out at 37°C overnight in culture tubes or 2 ml deep well microtiter plates. 
Lysis was performed as follows. Cultures (2-5 ml) were centrifuged and the cell 

5 pellets resuspended in 1 .5 mg/ml solution of lysostaphin (20 fil/ml of original culture) 
followed by addition of 250 \il of resuspension buffer (Qiagen). Alternatively, cell 
pellets were resuspended directly in 250 ^1 of resuspension buffer (Qiagen) to which 
5-20 nl of a 1 mg/ml lysostaphin solution were added. DNA was isolated using 
Qiagen miniprep kits or Wizard (Qiagen) miniprep kits according to the instructions 

1 0 provided by the manufacturer. 

The genomic DNA inserts were then amplified from the purified plasmids by 
PCR as follows. Approximately, 1 :1 of Qiagen purified plasmid was put into a total 
reaction volume of 25 :1 Qiagen Hot Start PCR mix. The following primers were used 
in the PCR reaction: 

15 

P XylT5F: CAGCAGTCTGAGTTATAAAATAG (SEQ ID NO: 24) 
LexL TGTTTTATCAGACCGCTT (SEQ ID NO: 25) 

PCR was carried out in a PE GenAmp with the following cycle times: 
Step 1. 95 °C 15min 
Step 2. 94 °C 45 sec 
Step 3. 54 °C 45 sec 
Step 4. 72 °C 1 minute 
Step 5. Return to step 2 (29 times) 
Step 6. 72 °C 10 minutes 
Step 7. 4°Chold 

The PCR products were cleaned using Qiagen Qiaquick PCR plates according 
to the manufacturer's instructions. The amplified genomic DNA inserts were then 
30 subjected to automated sequencing. 



20 



25 
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For E. faecalis, piasmids from transformant colonies that received a dilution 
plating score of "2" or greater were isolated to obtain the genomic DNA insert 
responsible for growth inhibition as follows. E. faecalis were grown in THB 10 fig/ml 
Erm at 30°C overnight in 100 ul culture wells in microtiter plates. To amplify insert 
5 DNA 2 ul of culture were placed into 25 jjJ Qiagen Hot Start PCR mix. PCR 
reactions were in 96 well microtiter plates. Primers to vector regions flanking the 
insert were used in the PCR reaction. PCR was carried out in a PE GenAmp with the 
following cycle times: 

10 Step 1. 95° C 15min 

Step 2. 94° C 45 sec 

Step 3. 54° C 45 sec 

Step 4. 72° C 1 minute 

Step 5. Return to step 2, 29 times 
15 Step 6. 72° C 10 minutes 

Step 7. 4° C hold 

The PCR products were cleaned using Qiagen Qiaquick PCR plates according 
to the manufacturer's instructions. The purified PCR products were then directly 
20 cycle sequenced with Qiagen Hot Start PCR mix. PCR was carried out in a PE 
GenAmp with the following cycle times: 

Step 1. 94° C 15 min 
Step 2. 96° C 10 sec 
25 Step 3. 50° C 5 sec 
Step 4. 60° C 4 min 
Step 5. Return to step 2, 24 times 
Step 6. 4° C hold 
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The PCR products were cleaned using Qiagen Qiaquick PCR plates according 
to the manufacturer's instructions. The amplified genomic DNA inserts were then 
subjected to automated sequencing. 

The example below describes techniques that were used to compare the 
5 sequences identified by the approaches above to sequences present in publicly available 
databases so as to better characterize the essential genes. 

EXAMPLE 13 

Com parison Of Isolated Nucleic Acids to Known Sequences 

10 With reference to experiments that were performed using the antisense 

technique, this example describes approaches that were used to characterize the nucleic 
acid sequence of newly discovered essential genes in S. aureus and E.faecalis. It should 
be understood that the teachings herein can also be used to determine the nucleic acid 
sequence of essential genes in other Gram-positive cells and essential genes identified in 

15 Gram-positive cells using the promoter replacement, operator insertion or Random 
Insertional Cis- Antisense technique. 

The nucleic acid sequences of the subcloned fragments from S. aureus or £ 
faecalis obtained from the expression vectors containing the Xyl-T5 and CP25 fusion 
promoters discussed above were compared to known sequences from S. aureus or £. 

20 faecalis and other microorganisms as follows. First, to confirm that each clone 
originated from one location on the chromosome and was not chimeric, the sequences 
of the selected clones were compared against the S. aureus or E. faecalis genomic 
sequences to align the clone to the correct position on the chromosome. The NCBI 
BLASTN v 2.0.9 program was used for this comparison, and the incomplete 

25 Staphylococcus aureus genomic sequences licensed from TIGR, as well as the NCBI 
nonredundant GenBank database were used as the source of genomic data. The E. 
faecalis sequences were compared to a proprietary database. 

The BLASTN analysis was performed using the default parameters except that 
the filtering was turned off. No further analysis was performed on inserts which 

30 resulted from the ligation of multiple fragments. 
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In general, antisense molecules and their complementary genes are identified 
as follows. First, all possible full length open reading frames (ORFs) are extracted 
from available genomic databases. Such databases include the GenBank 
nonredundant (nr) database, the unfinished genome database available from TIGR and 
5 the PathoSeq database developed by Incyte Genomics. The latter database comprises 
over 40 annotated bacterial genomes including complete ORF analysis. If databases 
are incomplete with regard to the bacterial genome of interest, it is not necessary to 
extract all ORFs in the genome but only to extract the ORFs within the portions of the 
available genomic sequences which are complementary to the clones of interest. 
10 Computer algorithms for identifying ORFs, such as GeneMark, are available and well 
known to those in the art. Comparison of the clone DNA to the complementary 
ORF(s) allows determination of whether the clone is a sense or antisense clone. 
Furthermore, each ORF extracted from the database can be compared to sequences in 
well annotated databases including the GenBank (nr) protein database, SWISSPROT 
1 5 and the like. A description of the gene or of a closely related gene in a closely related 
microorganism is often available in these databases. Similar methods are used to 
identify antisense clones corresponding to genes encoding non- translated RNAs. 

In some cases, for example, the NCBI BLASTN 2.0.9 computer algorithm was 
used and the default parameters were used with the exception that filtering was turned 
20 off. The default parameters for the BLASTN and BLASTX analyses were: 

Expectation value (e)=10 

Alignment view options: pairwise 

Filter query sequence (DUST with BLASTN, SEG with others)=T 
Cost to open a gap (zero invokes behavior)=0 
25 Cost to extend a gap (zero invokes behavior)=0 

X dropoff value for gapped alignment (in bits) (zero invokes behavior)=0 
Show GFs in deflines=F 

Penalty for a nucleotide mismatch (BLASTN only)=!3 

Reward for a nucleotide match (BLASTN only)=l 
30 Number of one-line descriptions (V)=500 

Number of alignments to show (B)=250 

Threshold for extending hits=default 

Perform gapped alignment (not available with BLASTX)=T 

Query Genetic code to use=l 
35 DB Genetic code (for TBLAST[nx] only=l 
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Number of processors to use=l 
Seq Align file 

Believe the query defline=F 
Matrix=BLOSUM62 
5 Word Size= default 

Effective length of the database (use zero for the real size)=0 
Number of best hits from a region to keep=100 
Length of region used to judge hits=20 

Effective length of the search space (use zero for the real size)=0 
1 0 Query strands to search against database (for BLAST[nx] and TBLASTX), 3 is 

both, 1 is top, 2 is bottom=3 

Produce HTML outpuHF 

Those ORFs found to be complementary to the cloned nucleic acid sequences 
discussed above were evaluated with regard to orientation. Antisense clones were 
15 identified as those clones for which transcription from the inducible promoter would 
result in the expression of an RNA antisense to the complementary ORR 

The gene descriptions in the PathoSeq database derive from annotations 
available in the public sequence databases described above. It will be appreciated that 
ORFs may also be identified using databases other than PathoSeq. For example, the 
20 ORFs may be identified using the methods described in U.S. Provisional Patent 
Application Serial Number 60/191,078, filed March 21, 2000, the disclosure of which 
is incorporated herein by reference in its entirety. 

In the antisense-based screening experiments in Enterococcus faecalis as 
described in Examples 8 and 9, 256 proliferation-inhibiting antisense sequences 
25 corresponding to 61 unique ORFs were discovered using the CP25 fusion promoter 
whereas 790 proliferation-inhibiting antisense sequences corresponding to 150 unique 
ORFs were discovered using the P59 fusion promoter. 

The example below describes approaches that may be used to identify operons or 
downstream genes associated with the essential genes identified by the methods above. 

30 

EXAMPLE 14 

Identification of Genes and their Corresponding Operons th at are Affected by 
Proliferation-Inhibiting Antisense Nucleic Acids Expressed from Fusio n Promoters 
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With reference to experiments that were performed using the antisense 
technique, this example describes approaches that may be used to identify operons or 
downstream genes associated with the newly discovered essential genes in & aureus and 
Rfaecalis. It should be understood that the teachings herein can also be used to identify 

5 operons or downstream genes associated with the essential genes in other Gram-positive 
cells and essential genes identified in Gram-positive cells using the promoter 
replacement, operator insertion or Random Insertional Cis- Antisense technique 

Once the genes involved in S. aureus and £. faecalis proliferation are identified 
as described above, the operons in which these genes lie may be identified by 

10 comparison with known microbial genomes. Since bacterial genes are expressed in a 
polycistronic manner, the antisense inhibition of a single gene in an operon might affect 
the expression of all the other genes on the operon or the genes downstream from the 
single gene identified. Accordingly, each of the genes contained within an operon may 
be analyzed for their effect on proliferation. 

15 Operons are predicted by looking for all adjacent genes in a genomic region 

that lie in the same orientation with no large noncoding gaps in between. First, full 
length ORFs complementary to the antisense molecules are identified as described 
above. Adjacent ORFs are then identified and their relative orientation determined 
either by directly analyzing the genomic sequences surrounding the ORFs 

20 complementary to the antisense clones or by extracting adjacent ORFs from the 
collection obtained through whole genome ORF analysis described above followed by 
ORF alignment. Operons predicted in this way may be confirmed by comparison to 
the arrangement of the homologous genes in the Bacillus subtilis complete genome 
sequence, as reported by the genome database compiled at Institut Pasteur Subtilist 

25 Release R15.1 (June 24, 1999) which can be found at 
http://bioweb.pasteur.fr/GenoList/SubtiList/. The Bacillus subtilis genome is the only 
fully sequenced and annotated genome from a Gram-positive microorganism, and 
appears to have a high level of similarity to Staphylococcus aureus both at the level 
of conservation of gene sequence and genomic organization including operon 

30 structure. Annotation of some of the DNA sequences in some of the aforementioned 
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databases is lacking, but comparisons may be made to E. coli using tools such as 
BLASTX. Public or proprietary databases may be used to analyzed E. faecalis 
sequences as well as the databases listed above. 

Once the full length ORFs and/or the operons containing them have been 

5 identified using the methods described above, they can be obtained from a genomic 
library by performing a PCR amplification using primers at each end of the desired 
sequence. Those skilled in the art will appreciate that a comparison of the ORFs to 
homologous sequences in other microorganisms will facilitate confirmation of the 
start and stop codons at the ends of the ORFs. 

10 In some embodiments, the primers may contain restriction sites which 

facilitate the insertion of the gene or operon into a desired vector. For example, the 
gene may be inserted into an expression vector and used to express the proliferation- 
required protein as described below. Other methods for obtaining the full length 
ORFs and/or operons are familiar to those skilled in the art. For example, natural 

15 restriction sites may be employed to insert the full length ORFs and/or operons into a 
desired vector. 

The next example describes approaches that may be used to identify individual 
genes within an operon that was characterized as being required for proliferation. 

20 EXAMPLE 15 

Identification of Individual Genes within an Operon Required for Proliferation 
With reference to experiments that were performed using the antisense 
technique, this example describes approaches that may be used to identify individual 
genes within an operon that has been characterized as being required for proliferation 

25 of a Gram-positive organism such as S. aureus or E. faecalis or the other Gram-positive 
organisms listed herein. It should be understood that the teachings herein can also be 
used to identify individual genes within an operon that were characterized as being 
required for proliferation in other Gram-positive cells and can be used in conjunction 
with the promoter replacement, operator insertion or Random Insertional Cis- Antisense 

30 technique. 
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The strategy described in this example seeks to determine if a targeted gene 
within an operon is required for cell proliferation by replacing the targeted gene in the 
chromosome with an in-frame deletion of the coding region of the targeted gene. 
Deletion inactivation of a chromosomal copy of a gene in Staphylococcus aureus or 

5 Enterococcus faecalis can be accomplished by integrative gene replacement The 
principles of this method were described in Xia, M., et al. 1999 Plasmid 42:144-149 
and Hamilton, C. M. f et al 1989. J, Bacteriol 171: 4617-4622, the disclosures of 
which are incorporated herein by reference in their entireties. In this approach, a 
mutant allele of the targeted gene is constructed by way of an in-frame deletion and 

10 introduced into the chromosome using a suicide vector. This results in a tandem 
duplication comprising a deleted (null) allele and a wild type allele of the target gene. 
Cells in which the vector sequences have been deleted are isolated using a counter- 
selection technique. Removal of the vector sequence from the chromosomal insertion 
results in either restoration of the wild-type target sequence or replacement of the wild 

15 type sequence with the deletion (null) allele. E. faecalis genes can be disrupted using 
a suicide vector that contains an internal fragment to a gene of interest. With the 
appropriate selection this plasmid will homologously recombine into the chromosome 
(Nallapareddy, S. R., X. Qin, G. M. Weinstock, M. Hook, B. E. Murray. 2000. Infect. 
Immun. 68:5218-5224, the disclosure of which is incorporated herein by reference). 

20 The resultant population of Staphylococcus aureus, Enterococcus faecalis, or 

other Gram-positive colonies can then be evaluated to determine whether the target 
sequence is required for proliferation by PCR amplification of the affected target 
sequence. If the targeted gene is not required for proliferation, then PCR analysis will 
show that roughly equal numbers of colonies have retained either the wild-type or the 

25 mutant allele. If the targeted gene is required for proliferation, then only wild-type 
alleles will be recovered in the PCR analysis. 

The method of cross-over PCR is used to generate the mutant allele by 
amplification of sequences flanking but not including the coding region of the gene of 
interest, such that overlap between the resulting two PCR amplification products 

30 allows them to hybridize. Further PCR amplification of this hybridization product 
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using primers representing the extreme 5* and 3* ends can produce an amplification 
product containing an in-frame deletion of the coding region but retaining substantial 
flanking sequences. 

For Staphylococcus aureus, this amplification product is subcloned into the 

5 suicide vector pSA3182 (Xia, M., et al 1999 Plasmid 42:144-149, the disclosure of 
which is incorporated herein by reference in its entirety) which is host-dependent for 
autonomous replication. This vector includes a tetC tetracycline-resistance marker 
and the origin of replication of the well-known Staphylococcus aureus plasmid pT181 
(Mojumdar, M and Kahn, S.A., Characterisation of the Tetracycline Resistance Gene 

10 of Plasmid pT18t, J. Bacteriol 170: 5522 (1988), the disclosure of which is 
incorporated herein by reference in its entirety). The vector lacks the repC gene which 
is required for autonomous replication of the vector at the pTISI origin. However 
autonomous replication of this vector can occur in a Staphylococcus aureus host strain 
such as SA3528, which expresses repC in trans. Once the amplified truncated target 

15 gene sequence is cloned and propagated in the pSA3182 vector, it can then be 
introduced into a repC minus strain such as RN4220 (Kreiswirth, B.N. et al, The 
Toxic Shock Syndrome Exotoxin Structural Gene is Not Detectably Transmitted by a 
Prophage, Nature 305:709-712 (1983), the disclosure of which is incorporated herein 
by reference in its entirety) by electroporation with selection for tetracycline 

20 resistance. In this strain, the vector must integrate by homologous recombination at 
the targeted gene in the chromosome to impart drug resistance. This results in a 
inserted truncated copy of the allele, followed by pSA3182 vector sequence, and 
finally an intact and functional allele of the targeted gene. 

Once a tetR Staphylococcus aureus strain is isolated from the above technique 

25 and shown to include truncated and wild-type alleles of the targeted gene as described 
above, a second plasmid, pSA7592 (Xia, M., et al 1999 Plasmid 42:144-149, the 
disclosure of which is incorporated herein by reference in its entirety) is introduced 
into the strain by electroporation. This gene includes an erythromycin resistance gene 
and a repC gene that is expressed at high levels. Expression of repC in these 

30 transformants is toxic due to interference of normal chromosomal replication at the 
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integrated pT181 origin of replication. This counterselects for strains that have 
removed the vector sequence by homologous recombination, resulting in either of two 
outcomes: The counterselected cells either possess a wild-type allele of the targeted 
gene or a gene in which the wild-type allele has been replaced by the engineered in- 
5 frame deletion of the truncated allele. 

PCR amplification can be used to test either outcome among the resulting 
erythromycin resistant, tet sensitive transformant colonies. If the targeted gene is not 
required for cellular replication, then PCR evidence for both wild-type and mutant 
alleles will be found among the population of resultant transformants. However, if 
10 the targeted gene is required for cellular replication, then only the wild-type form of 
the gene will be evident among the resulting transformants. 

Similarly, for Enterococcus faecalis the PCR products containing the mutant 
allele of the target sequence may be introduced into an appropriate knockout vector 
and cells in which the wild type target has been disrupted are selected using the 
1 5 appropriate methodology. 

The above methods have the advantage that insertion of an in- frame deletion 
mutation is far less likely to cause downstream polar effects on genes in the same 
operon as the targeted gene. However, it will be appreciated that other methods for 
disrupting Staphylococcus aureus or Enterococcus faecalis, which are familiar to 
20 those skilled in the art may also be used. Each gene in the operon may be disrupted 
using the methodology above to determine whether it is required for proliferation. 

The example below describes approaches to express genes involved in cell 
proliferation. 

25 EXAMPLE 16 

Expression of the Proteins Encoded bv Genes Identified as 
Required for Staphylococcus aureus or Enterococcus faecalis Proliferation 
The following is provided as one exemplary method to express the proliferation- 
required proteins encoded by sequences identified by the methods described herein using 
30 expression systems designed either for E. coli f Staphylococcus aureus, or 
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Enterococcus faecalis. First, the initiation and termination codons for the gene are 
identified. If desired, methods for improving translation or expression of the protein are 
well known in the art. For example, if the nucleic acid encoding the polypeptide to be 
expressed lacks a methionine codon to serve as the initiation site, a strong Shine- 

5 Dalgamo sequence, or a stop codon, these sequences can be added. Similarly, if the 
identified nucleic acid sequence lacks a transcription termination signal, this sequence 
can be added to the construct by, for example, splicing out such a sequence from an 
appropriate donor sequence. In addition, the coding sequence may be operably linked to 
a strong promoter or an inducible promoter, such as the fusion promoters described 

10 herein, if desired. The identified nucleic acid sequence or portion thereof encoding the 
polypeptide to be expressed is obtained by, for example, PCR from the bacterial 
expression vector or genome using oligonucleotide primers complementary to the 
identified nucleic acid sequence or portion thereof and containing restriction 
endonuclease sequences appropriate for inserting the coding sequences into the vector 

15 such that the coding sequences can be expressed from the vector's promoter. 
Alternatively, other conventional cloning techniques may be used to place the coding 
sequence under the control of the promoter. In some embodiments, a termination signal 
may be located downstream of the coding sequence such that transcription of the coding 
sequence ends at an appropriate position. 

20 Several expression vector systems for protein expression in E. coli are well 

known and available to someone knowledgeable in the art. The coding sequence may 
be inserted into any of these vectors and placed under the control of the promoter. 
The expression vector may then be transformed into DH5a or some other E. coli strain 
suitable for the over expression of proteins. 

25 Alternatively, an expression vector encoding a protein required for proliferation 

of Staphylococcus aureus or Enterococcus faecalis may be introduced into 
Staphylococcus aureus or Enterococcus faecalis. In some embodiments, gene 
encoding the protein required for proliferation may be operably linked to one of the 
fusion promoters described herein. Electroporation protocols for introducing nucleic 

30 acids into Staphylococcus aureus and Enterococcus faecalis are well known in the art 
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For example, the electroporation protocol described in Staphylococcus aureus 
transformation protocols described in J.CLee "Electroporation of Staphylococci" from 
Methods in Molecular Biology vol 47: Electroporation Protocols for Microorganisms 
Edited by : J.A. Nickoloff Humana Press Inc., Totowa, NJ. pp209-216, the disclosure 

5 of which is incorporated herein by reference in its entirety, may be used. Positive 
transformants are selected after growing the transformed cells on plates containing an 
antibiotic to which the vector confers resistance. For example, Staphylococcus aureus 
may be transformed with an expression vector in which the coding sequence is operably 
linked to the fusion promoters described herein, such as fusion promoters comprising the 

10 T5 promoter operably linked to a xyl operator such that expression of the encoded 
protein is inducible with xylose. Such an example vector is pXyl-T5 pi 5a. In another 
example, Enterococcus faecalis may be transformed with an expression vector in 
which the coding sequence is operably linked to the fusion promoters described herein, 
such as fusion promoters comprising the PL promoter operably linked to a xyl operator 

15 such that expression of the encoded protein is inducible with xylose. Such an example 
vector is pEPEF21. 

In one embodiment, the protein is expressed and maintained in the cytoplasm 
as the native sequence. In an alternate embodiment, the expressed protein can be 
modified to include a protein tag that allows for differential cellular targeting, such as 

20 to the periplasmic space of Gram-negative or Gram-positive expression hosts or to the 
exterior of the cell (i.e., into the culture medium). In some embodiments, the osmotic 
shock cell lysis method described in Chapter 16 of Current Protocols in Molecular 
Biology, Vol. 2, (Ausubel, et aL 9 Eds.) John Wiley & Sons, Inc. (1997) may be used to 
liberate the polypeptide from the cell. In still another embodiment, such a protein tag 

25 could also facilitate purification of the protein from either fractionated cells or from 
the culture medium by affinity chromatography. Each of these procedures can be used 
to express a proliferation-required protein. 

Expressed proteins, whether in the culture medium or liberated from the 
periplasmic space or the cytoplasm, are then purified or enriched from the supernatant 

30 using conventional techniques such as ammonium sulfate precipitation, standard 
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chromatography, immunoprecipitation, immunochromatography, size exclusion 
chromatography, ion exchange chromatography, and HPLC. Alternatively, the 
polypeptide may be secreted from the host cell in a sufficiently enriched or pure state in 
the supernatant or growth media of the host cell to permit it to be used for its intended 

5 purpose without further enrichment. The purity of the protein product obtained can be 
assessed using techniques such as SDS PAGE, which is a protein resolving technique 
well known to those skilled in the art. Coomassie, silver staining or staining with an 
antibody are typical methods used to visualize the protein of interest. 

Antibodies capable of specifically recognizing the protein of interest can be 

10 generated using synthetic peptides using methods well known in the art. See, 
Antibodies: A Laboratory Manual, (Harlow and Lane, Eds.) Cold Spring Harbor 
Laboratory (1988). For example, 15-rner peptides having a sequence encoded by the 
appropriate identified gene sequence of interest or portion thereof can be chemically 
synthesized. The synthetic peptides are injected into mice to generate antibodies to the 

15 polypeptide encoded by the identified nucleic acid sequence of interest or portion 
thereof. Alternatively, samples of the protein expressed from the expression vectors 
discussed above can be purified and subjected to amino acid sequencing analysis to 
confirm the identity of the recombinantly expressed protein and subsequently used to 
raise antibodies. 

20 The protein encoded by the identified nucleic acid sequence of interest or portion 

thereof can be purified using standard immunochromatography techniques. In such 
procedures, a solution containing the desired protein, such as the culture medium or a 
cell extract, is applied to a column having antibodies against the secreted protein 
attached to the chromatography matrix. The desired protein is allowed to bind the 

25 immunochromatography column. Thereafter, the column is washed to remove non- 
specifically bound proteins. The specifically-bound secreted protein is then released 
from the column and recovered using standard techniques. These procedures are well 
known in the art. 

In an alternative protein purification scheme, the identified nucleic acid sequence 
30 of interest or portion thereof can be incorporated into expression vectors designed for 
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use in purification schemes employing chimeric polypeptides. In such strategies the 
coding sequence of die identified nucleic acid sequence of interest or portion thereof is 
inserted in-frame with the gene encoding die other half of the chimera. The other half of 
the chimera can be maltose binding protein (MB?) or a nickel binding polypeptide 

5 encoding sequence. A chromatography matrix having antibody to MBP or nickel 
attached thereto is then used to purify the chimeric protein. Protease cleavage sites can 
be engineered between the MBP gene or the nickel binding polypeptide and the 
identified expected gene of interest, or portion thereof. Thus, the two polypeptides of 
the chimera can be separated from one another by protease digestion. 

10 One useful expression vector for generating maltose binding protein fusion 

proteins is pMAL (New England Biolabs), which encodes the malE gene. In the pMal 
protein fusion system, the cloned gene is inserted into a pMal vector downstream from 
the malE gene. This results in the expression of an MBP-fusion protein. The fusion 
protein is purified by affinity chromatography. These techniques as described are well 

15 known to those skilled in the art of molecular biology. An Example describing in detail 
the generation of monoclonal and polyclonal antibodies appears below. 

EXAMPLE 17 

Production of an Antibody to an isolated Staphylococcus aureus or Enterococcus 
20 faecalis Protein 

Substantially pure protein or polypeptide is isolated from the transformed cells as 
described in Example 16. The concentration of protein in the final preparation is 
adjusted, for example, by concentration on a 10,000 molecular weight cut off AMICON 
filter device (Millipore, Bedford, MA), to the level of a few micrograms/ml. 
25 Monoclonal or polyclonal antibody to the protein can then be prepared as follows: 

Monoclonal antibody to epitopes of any of the peptides identified and isolated as 
described can be prepared from murine hybridomas according to the classical method of 
Kohler, G. and Milstein, C, Nature 256:495 (1975) or any of the well-known derivative 
methods thereof. Briefly, a mouse is repetitively inoculated with a few micrograms of 
30 the selected protein or peptides derived therefrom over a period of a few weeks. The 
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mouse is then sacrificed, and the antibody-producing cells of the spleen isolated The 
spleen cells are fused by means of polyethylene glycol with mouse myeloma cells, and 
the excess unfused cells destroyed by growth of the system on selective media 
comprising aminopterin (HAT media). The successfully-fused cells are diluted and 
5 aliquots of the dilution placed in wells of a microtiter plate where growth of the culture 
is continued. Antibody-producing clones are identified by detection of antibody in the 
supernatant fluid of the wells by immunoassay procedures, such as EL1SA, as described 
by Engvall, E., Enzyme immunoassay ELISA and EMIT " MetK EnzymoL 70:419 
(1980), and derivative methods thereof. Selected positive clones can be expanded and 

10 their monoclonal antibody product harvested for use. Detailed procedures for 
monoclonal antibody production are described in Davis, L. et al Basic Methods in 
Molecular Biology Elsevier, New York. Section 21-2. 

Polyclonal antiserum containing antibodies to heterogeneous epitopes of a single 
protein or a peptide can be prepared by immunizing suitable animals with the expressed 

15 protein or peptides derived therefrom described above, which can be unmodified or 
modified to enhance immunogenicity. Effective polyclonal antibody production is 
affected by many factors related both to the antigen and the host species. For example, 
small molecules tend to be less immunogenic than larger molecules and can require the 
use of carriers and adjuvant. Also, host animals vary in response to site of inoculations 

20 and dose, with both inadequate or excessive doses of antigen resulting in low titer 
antisera. Small doses (ng level) of antigen administered at multiple intradermal sites 
appears to be most reliable. An effective immunization protocol for rabbits can be 
found in Vaitukaitis, J. et al. J. Clin. Endocrinol Metab. 33:988-991 (1971). 

Booster injections can be given at regular intervals, and antiserum harvested 

25 when antibody titer thereof, as determined semi-quantitatively, for example, by double 
immunodiffusion in agar against known concentrations of the antigen, begins to fall. 
See, for example, Ouchterlony, O. et aL, Chzp. 19 in: Handbook of Experimental 
Immunology D. Wier (ed) Blackwell (1973). Plateau concentration of antibody is 
usually in the range of 0. 1 to 0.2 mg/ml of serum (about 12 :M). Affinity of the antisera 

30 for the antigen is determined by preparing competitive binding curves, as described, for 
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example, by Fisher, D., Chap. 42 in: Manual of Clinical Immunology, 2d Ed. (Rose and 
Friedman, Eds.) Amer. Soc. For Microbiol., Washington, D.C. (1980). 

Antibody preparations prepared according to either protocol are useful in 
quantitative immunoassays which determine concentrations of antigen-bearing 
5 substances in biological samples; they are also used semi-quantitatively or qualitatively 
to identify the presence of antigen in a biological sample. The antibodies can also be 
used in therapeutic compositions for killing bacterial cells expressing the protein. The 
next section describes how to make and use an in vivo system for performing in vivo 
analysis on bacterial strains containing regulatable essential genes and for screening 
10 compounds that inhibit bacterial proliferation. 

The example below describes the an in vivo system for analyzing the growth 
characteristics of S. aureus and Enterococcus faecalis that contain regulatable 
essential genes. 

15 EXAMPLE 18 

In Vivo Models for Analyzing the Regulation of Essential Genes and Screening 

Therapeutic Compounds 
Whether a given gene is essential for a bacterium growing in nature (in vivo) or 
whether a gene is required for infection of a host organism can be determined by 

20 constructing a conditional allele of that gene which is suitable for animal testing. 
Expression of an antisense RNA molecule complementary to a candidate gene can, by 
decreasing synthesis of the candidate gene product, constitute a conditional allele that 
is suitable for animal testing. Candidate genes essential in vivo or required for 
virulence can be tested by preparing bacterial strains using the EGI, antisense, 

25 promoter replacement, operator insertion, and random promoter insertion techniques. 
Preferably, the constructs used to prepare these bacterial strains contain an expression 
vector under control of a fusion promoter described herein. Animals infected by these 
bacterial strains can be used to evaluate the ability to modulate bacterial proliferation 
in vivo by adding or decreasing the concentration of inducer and can be used to screen 

30 compounds in vivo to validate their therapeutic properties. 



-129- 



WO 02/051982 



PCT/US01/50250 



Stable transformation of Staphylococcus aureus or Enterococcus faecalis with 
constructs prepared as described herein can be accomplished by electroporation and 
selection for clones resistant to 15 ng/ml chloramphenicol in the case of 
Staphylococcus aureus of 20 ^ig/ml erythromycin in the case of Enterococcus faecalis. 

5 Preferably, whether each construct is essential is verified in vitro using induction in 
medium containing various amounts of inducer or an analog thereof, prior to 
introducing the cells into animals. 

For the thigh infection model, animals are rendered neutropenic chemically 
prior to the start of the experiment. By one approach, the inoculum will consist of 

10 10 5 to 10 6 cells of a normally virulent Gram-positive organism expressing antisense to 
an essential gene such as that for gyrB under control of a fusion promoter described 
herein. These cells are injected to one thigh of a suitable animal (e.g., mice). Most 
microorganisms attain a logarithmic growth 2 hours after inoculation. Treated 
animals receive therapy from 2 hours post inoculation up to 24 hours. A typical 

15 efficacy study consists of 2 control groups and 5 treatment groups, each being treated 
with different doses of inducer. With 15 animals per group, 105 mice are used per 
study. The control sets are given intraperitoneal injections of saline, which will not 
induce expression of the fusion promoter. The experimental animals are given the 
inducer (e.g. xylose, tetracycline, IPTG, maltose, absence of tryptophan, or 

20 temperature change) in intraperitoneal injections to induce expression of the antisense 
promoter. Alternatively, the promoter can be induced by intravenous infusion of 
inducer at sub-toxic levels. 

The end point used to follow the infection process is viable bacterial counts 
per thigh. The controls in which the antisense RNA is not induced will fail to impede 

25 growth of the Gram-positive organism and thus a logarithmic increase in viable 
bacteria will occur. The Gram-positive cells recovered from the site of infection 
should be viable until antisense expression is subsequently induced. This will 
demonstrate that the plasmid is still patent. However, in animals receiving the xylose 
injections, expression of the antisense RNA will occur, the essential gene or gene 

30 product will be compromised, and the Gram-positive cells infecting the mice will not 
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multiply. Accordingly, fewer viable cells will be recovered from the site of infection 
in the experimental animals. The Gram-positive cells from the induced mice will be 
recovered, if still present, and assayed as above to determine if the promoter and gene 
are still present and functional. 
5 The sections below describe the use of fusion promoters in methods to discover 

compounds that inhibit bacterial proliferation. 

EXAMPLE 19 

Screening Chemical Libraries in Cells Sensitized with Antisense RNA Transcribed from 

10 Fusion Promoters 

Having isolated and expressed bacterial proteins shown to be required for 
bacterial proliferation, the present invention further contemplates the use of these 
expressed target proteins in assays to screen libraries of compounds for potential drug 
candidates. The generation of chemical libraries is well known in the art. For example, 

1 5 combinatorial chemistry can be used to generate a library of compounds to be screened 
in the assays described herein. A combinatorial chemical library is a collection of 
diverse chemical compounds generated by either chemical synthesis or biological 
synthesis by combining a number of chemical building block" reagents. For example, a 
linear combinatorial chemical library such as a polypeptide library is formed by 

20 combining amino acids in every possible combination to yield peptides of a given 
length. Millions of chemical compounds theoretically can be synthesized through such 
combinatorial mixings of chemical building blocks. For example, one commentator 
observed that the systematic, combinatorial mixing of 100 interchangeable chemical 
building blocks results in the theoretical synthesis of 100 million tetrameric compounds 

25 or 10 billion pentameric compounds. (Gallop et aL, 1994, "Applications of 
Combinatorial Technologies to Drug Discovery, Background and Peptide Combinatorial 
Libraries" Journal of Medicinal Chemistry, 37: 1233-1250). Other chemical libraries 
known to those in the art may also be used, including natural product libraries. 

Once generated, combinatorial libraries can be screened for compounds that 

30 possess desirable biological properties. For example, compounds which may be useful 
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as drugs or to develop drugs would likely have the ability to bind to the target protein 
identified, expressed and purified as discussed above. Further, if the identified target 
protein is an enzyme, candidate compounds would likely interfere with the enzymatic 
properties of the target protein. For example, the enzymatic function of a target protein 
5 may be to serve as a protease, nuclease, phosphatase, dehydrogenase, transporter 
protein, transcriptional enzyme, and any other type of enzyme known or unknown. 
Thus, the present invention contemplates using the protein products described above to 
screen combinatorial chemical libraries. 

In one example, the target protein is a serine protease and the substrate of the 
10 enzyme is known. The present example is directed towards the analysis of libraries of 
compounds to identify compounds that function as inhibitors of the target enzyme. First, 
a library of small molecules is generated using methods of combinatorial library 
formation well known in the art. U.S. Patent Nos. 5,463,564 and 5,574, 656, to 
Agrafiotis, et al, entitled "System and Method of Automatically Generating Chemical 
15 Compounds with Desired Properties," the disclosures of which are incorporated herein 
by reference in their entireties, are two such teachings. Then the library compounds are 
screened to identify those compounds that possess desired structural and functional 
properties. U.S. Patent No. 5,684,71 1, the disclosure of which is incorporated herein by 
reference in its entirety, also discusses a method for screening libraries. 
20 To illustrate the screening process, the target polypeptide and chemical 

compounds of the library are combined with one another and permitted to interact with 
one another. A labeled substrate is added to the incubation. The label on the substrate is 
such that a detectable signal is emitted from metabolized substrate molecules. The 
emission of this signal permits one to measure the effect of the combinatorial library 
25 compounds on the enzymatic activity of target enzymes by comparint it to the signal 
emitted in the absence of combinatorial library compounds. The characteristics of each 
library compound are encoded so that compounds demonstrating activity against the 
enzyme can be analyzed and features common to the various compounds identified can 
be isolated and combined into future iterations of libraries. 
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Once a library of compounds is screened, subsequent libraries are generated 
using those chemical building blocks that possess the features shown in the first round of 
screen to have activity against the target enzyme. Using this method, subsequent 
iterations of candidate compounds will possess more and more of those structural and 

5 functional features required to inhibit the function of the target enzyme, until a group of 
enzyme inhibitors with high specificity for the enzyme can be found. These compounds 
can then be further tested for their safety and efficacy as antibiotics for use in mammals. 

It will be readily appreciated that this particular screening methodology is 
exemplary only. Other methods are well known to those skilled in the art. For 

10 example, a wide variety of screening techniques are known for a large number of 
naturally-occurring targets when the biochemical function of the target protein is 
known. For example, some techniques involve the generation and use of small peptides 
to probe and analyze target proteins both biochemically and genetically in order to 
identify and develop drug leads. Such techniques include the methods described in PCT 

15 publications No. W09935494, W09819162, W09954728, the disclosures of which are 
incorporated herein by reference in their entireties. 

Current cell-based assays used to identify or to characterize compounds for 
drug discovery and development frequently depend on detecting the ability of a test 
compound to modulate the activity of a target molecule located within a cell or located 

20 on the surface of a cell. Most often such target molecules are proteins such as 
enzymes, receptors and the like. However, target molecules may also include other 
molecules such as DNAs, lipids, carbohydrates and RNAs including messenger 
KNAs, ribosomal RNAs, tRNAs and the like. A number of highly sensitive cell-based 
assay methods are available to those of skill in the art to detect binding and interaction 

25 of test compounds with specific target molecules. However, these methods are 
generally not highly effective when the test compound binds to or otherwise interacts 
with its target molecule with moderate or low affinity. In addition, the target molecule 
may not be readily accessible to a test compound in solution, such as when the target 
molecule is located inside the cell or within a cellular compartment such as the 

30 periplasm of a bacterial cell Thus, current cell-based assay methods are limited in 
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that they are not effective in identifying or characterizing compounds that interact with 
their targets with moderate to low affinity or compounds that interact with targets that 
are not readily accessible. 

The cell-based assay methods of the present invention have substantial 
5 advantages over current cell-based assays. These advantages derive from the use of 
sensitized cells in which the level or activity of at least one proliferation-required gene 
product (the target molecule) has been specifically reduced to the point where the 
presence or absence of its function becomes a rate-determining step for cellular 
proliferation. In the cell-based assays of the present invention, the level or activity of 
10 a proliferation-required gene product is reduced by transcribing an antisense nucleic 
acid complementary to at least a portion of the nucleic acid encoding the gene product 
from one of the fusion promoters described herein. Such sensitized cells become 
much more sensitive to compounds that are active against the affected target molecule. 
Thus, cell-based assays of the present invention are capable of detecting compounds 
15 exhibiting low or moderate potency against the target molecule of interest because 
such compounds are substantially more potent on sensitized ceils than on non- 
sensitized cells. The effect may be such that a test compound may be two to several 
times more potent, at least 10 times more potent, at least 20 times more potent, at least 
50 times more potent, at least 100 times more potent, at least 1000 times more potent, 
20 or even more than 1000 times more potent when tested on the sensitized cells as 
compared to the non-sensitized cells. 

Due in part to the increased appearance of antibiotic resistance in pathogenic 
microorganisms and to the significant side-effects associated with some currently used 
antibiotics, novel antibiotics acting at new targets are highly sought after in the art. 
25 Yet, another limitation in the current art related to cell-based assays is the problem of 
repeatedly identifying hits against the same kinds of target molecules in the same 
limited set of biological pathways. This may occur when compounds acting at such 
new targets are discarded, ignored or fail to be detected because compounds acting at 
the "old" targets are encountered more frequently and are more potent than 
30 compounds acting at the new targets. As a result, the majority of antibiotics, in use 
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currently interact with a relatively small number of target molecules within an even 
more limited set of biological pathways. 

The use of sensitized cells comprising the fusion promoters of the current 
invention provides a solution to the above problem in two ways. First, desired 

5 compounds acting at a target of interest, whether a new target or a previously known 
but poorly exploited target, can now be detected above the "noise" of compounds 
acting at the "old" targets due to the specific and substantial increase in potency of 
such desired compounds when tested on the sensitized cells of the current invention. 
Second, the methods used to sensitize cells to compounds acting at a target of interest 

10 may also sensitize these cells to compounds acting at other target molecules within the 
same biological pathway. For example, expression of an antisense molecule to a gene 
encoding a ribosomal protein is expected to sensitize the cell to compounds acting at 
that ribosomal protein and may also sensitize the cells to compounds acting at any of 
the ribosomal components (proteins or rRNA) or even to compounds acting at any 

15 target which is part of the protein synthesis pathway. Thus an important advantage of 
the present invention is the ability to reveal new targets and pathways that were 
previously not readily accessible to drug discovery methods. 

Sensitized cells of the present invention are prepared by reducing the activity 
or level of a target molecule. The target molecule may be a gene product, such as an 

20 RNA or polypeptide produced from the proliferation-required nucleic acids described 
herein. Alternatively, the target may be a gene product such as an RNA or polypeptide 
which is produced from a sequence within the same operon as the proliferation- 
required nucleic acids described herein. In addition, the target may be an RNA or 
polypeptide in the same biological pathway as the proliferation-required nucleic acids 

25 described herein. Such biological pathways include, but are not limited to, 
enzymatic, biochemical and metabolic pathways as well as pathways involved in the 
production of cellular structures such the cell wall. 

Current methods employed in the arts of medicinal and combinatorial 
chemistries are able to make use of structure-activity relationship information derived 

30 from testing compounds in various biological assays including direct binding assays 
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and cell-based assays. Occasionally compounds are directly identified in such assays 
that are sufficiently potent to be developed as drugs. More often, initial hit 
compounds exhibit moderate or low potency. Once a hit compound is identified with 
low or moderate potency, directed libraries of compounds are synthesized and tested 
5 in order to identify more potent leads. Generally these directed libraries are 
combinatorial chemical libraries consisting of compounds with structures related to 
the hit compound but containing systematic variations including additions, 
subtractions and substitutions of various structural features. When tested for activity 
against the target molecule, structural features are identified that either alone or in 

10 combination with other features enhance or reduce activity. This information is used 
to design subsequent directed libraries containing compounds with enhanced activity 
against the target molecule. After one or several iterations of this process, compounds 
with substantially increased activity against the target molecule are identified and may 
be further developed as drugs. This process is facilitated by use of the sensitized cells 

15 of the present invention since compounds acting at the selected targets exhibit 
increased potency in such cell-based assays, thus; more compounds can now be 
characterized providing more useful information than would be obtained otherwise. 

Thus, it is now possible using cell-based assays of the present invention to 
identify or characterize compounds that previously would not have been readily 

20 identified or characterized including compounds that act at targets that previously 
were not readily exploited using cell-based assays. The process of evolving potent 
drug leads from initial hit compounds is also substantially improved by the cell-based 
assays of the present invention because, for the same number of test compounds, more 
structure- function relationship information is likely to be revealed. 

25 The method of sensitizing a cell entails selecting a suitable gene or operon. A 

suitable gene or operon is one whose expression is required for the proliferation of the 
cell to be sensitized. The next step is to introduce into the cells to be sensitized, an 
antisense RNA capable of hybridizing to the suitable gene or operon or to the RNA 
encoded by the suitable gene or operon. Introduction of the antisense RNA can be in 

30 the form of an expression vector in which antisense RNA is produced under the 
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control of a fusion promoter as described herein. The amount of antisense RNA 
produced is limited by varying the inducer concentration to which the cell is exposed 
and thereby varying the activity of the promoter driving transcription of the antisense 
RNA. Thus, cells are sensitized by exposing them to an inducer concentration that 

5 results in a sub-lethal level of antisense RNA expression. 

In one embodiment of the cell-based assays, an antisense nucleic acid that is 
complementary to a proliferation-required gene from a Gram-positive organism is 
used to inhibit the production of a proliferation-required protein. Expression vectors 
which contain an antisense nucleic acid complementary to identified genes required 

10 for proliferation operably linked to a fusion promoter of the present invention are used 
to limit the concentration of a proliferation-required protein without severely 
inhibiting growth. To achieve that goal, a growth inhibition dose curve of inducer is 
calculated by plotting various doses of inducer against the corresponding growth 
inhibition caused by the antisense expression. From this curve, various percentages of 

15 antisense induced growth inhibition, from 1 to 100% can be determined. If the fusion 
promoter contained in the expression vector contains a xylO linked to one of the 
promoters of SEQ ID NO.: 36 - 45, transcription in Staphylococcus aureus and 
Enterococcus faecalis will be regulatable by the xyl repressor and expression from the 
promoter can be induced with xylose. Similarly, IPTG, xylose, tetracycline, maltose, 

20 absence of tip and temperature inducible promoters may be used. For example, the 
highest concentration of the inducer that does not reduce the growth rate significantly 
can be estimated from the curve. Cellular proliferation can be monitored by growth 
medium turbidity via OD measurements. In another example, the concentration of 
inducer that reduces growth by 25% can be predicted from the curve. In still another 

25 example, a concentration of inducer that reduces growth by 50% can be calculated. 
Additional parameters such as colony forming units (cfu) can be used to measure 
cellular viability. 

Cells to be assayed are exposed to the above-determined concentrations of 
inducer. The presence of the inducer at this sub-lethal concentration reduces the 
30 amount of the proliferation required gene product to the lowest amount in the cell that 
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will support growth. Cells grown in die presence of this concentration of inducer are 
therefore specifically more sensitive to inhibitors of the proliferation-required protein 
or RNA of interest or to inhibitors of proteins or RNAs in the same biological pathway 
as the proliferation-required protein or RNA of interest but not to inhibitors of 

5 unrelated proteins or RNAs. 

Cells pretreated with sub-inhibitory concentrations of inducer and thus 
containing a reduced amount of proliferation-required target gene product are then 
used to screen for compounds that reduce cell growth. The sub-lethal concentration of 
inducer may be any concentration consistent with the intended use of the assay to 

10 identify candidate compounds to which the cells are more sensitive. For example, the 
sub-lethal concentration of the inducer may be such that growth inhibition is at least 
about 5%, at least about 8% at least about 10%, at least about 20%, at least about 
30%, at least about 40%, at least about 50%, at least about 60% at least about 75%, or 
more. Cells which are pre-sensitized using the preceding method are more sensitive to 

15 inhibitors of the target protein because these cells contain less target protein to inhibit 
than wild-type cells. 

R will be appreciated that the cell-based assays described above may be implemented 
in Staphylococcus aureus and Enterococcus faecalis using proliferation-inhibiting 
antisense sequences under the control of a fusion promoter described herein. It will 

20 also be appreciated that the above cell-based assays employing the fusion promoters 
described herein can be implemented in other Gram-positive organims, including but 
not limited to, Bacillus anthracis, Clostridium botulinum, Clostridium difficile, 
Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, 
Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria 

25 monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia 
asteroides t Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus 
xylosis. Streptococcus pneumoniae, Streptococcus mutans or any species falling 
within the genera of any of the above species. 

In another embodiment of the cell-based assays of the present invention* the 

30 level or activity of a proliferation required gene product is reduced using a mutation, 
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such as a temperature sensitive mutation, in the proliferation-required sequence and an 
antisense nucleic acid complementary to the proliferation-required sequence which is 
transcribed from one of the fusion promoters described herein. Growing the cells at 
an intermediate temperature between the permissive and restrictive temperatures of 

5 the temperature sensitive mutant where the mutation is in a proliferation-required gene 
produces cells with reduced activity of the proliferation-required gene product The 
antisense RNA complementary to the proliferation-required sequence further reduces 
the activity of the proliferation required gene product. Drugs that may not have been 
found using either the temperature sensitive mutation or the antisense nucleic acid 

10 alone may be identified by determining whether cells in which expression of the 
antisense nucleic ^acid has been induced and which are grown at a temperature 
between the permissive temperature and the restrictive temperature are substantially 
more sensitive to a test compound than cells in which expression of the antisense 
nucleic acid has not been induced and which are grown at a permissive temperature. 

15 Also drugs found previously from either the antisense nucleic acid alone or the 
temperature sensitive mutation alone may have a different sensitivity profile when 
used in cells combining the two approaches, and that sensitivity profile may indicate a 
more specific action of the drug in inhibiting one or more activities of the gene 
product. 

20 Temperature sensitive mutations may be located at different sites within the 

gene and correspond to different domains of the protein. For example, the dnaB gene 
of Escherichia coli encodes the replication fork DNA helicase. DnaB has several 
domains, including domains for oiigomerization, ATP' hydrolysis, DNA binding, 
interaction with primasc, interaction with DnaC, and interaction with DnaA [(Biswas, 

25 E.E. and Biswas, S.B. 1999, 'Mechanism and DnaB helicase of Escherichia coli: 
structural domains involved in ATP hydrolysis, DNA binding, and oiigomerization" 
Biochem. 38: 10919-10928; Hiasa, H. and Marians, KJ. 1999, "Initiation of 
bidirectional replication at the chromosomal origin is directed by the interaction 
. between helicase and primase" J, Biol Chem. 274: 27244-27248; San Martin, C, et 

30 al 1998, 'Three-dimensional reconstructions from cryoelectron microscopy images 
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reveal an intimate complex between helicase DnaB and its loading partner DnaC 
Structure 6:501-9; Sutton, M.D. et al 1998. "Escherichia coli DnaA protein. The N- 
tenninal domain and loading of DnaB helicase at the E. coli chromosomal origin" J. 
Biol. Chem. 273: 34255-620, the disclosures of which are incorporated herein by 
5 reference in their entireties]. Temperature sensitive mutations in different domains of 
DnaB confer different phenotypes at the restrictive temperature, which include either 
an abrupt stop or slow stop in DNA replication with or without DNA breakdown 
(Wechsler, J. A. and Gross, J.D. 1971. ''Escherichia coli mutants temperature-sensitive 
for DNA synthesis ,, Mol Gen. Genetics 113: 273-284, the disclosure of which is 

10 incorporated herein by reference in its entirety) and termination of growth or cell 
death. Combining the use of temperature sensitive mutations in the dnaB gene that 
cause cell death at the restrictive temperature with an antisense to the dnaB gene could 
lead to the discovery of very specific and effective inhibitors of one or a subset of 
activities exhibited by DnaB. 

15 It will be appreciated that the above method may be performed with any 

mutation which reduces but does not eliminate the activity or level of the gene product 
which is required for proliferation. 

When screening for antimicrobial agents against a gene product required for 
proliferation, growth inhibition of cells containing a limiting amount of that 

20 proliferation-required gene product can be assayed. Growth inhibition can be 
measured by directly comparing the amount of growth, measured by the optical 
density of the growth medium, between an experimental sample and a control sample. 
Alternative methods for assaying cell proliferation include measuring the activity of 
any of a variety of reporter genes, various enzymatic activity assays, and other 

25 methods well known in the art. 

It will be appreciated that the above method may be performed in solid phase, 
liquid phase or a combination of the two. For example, cells grown on nutrient agar 
containing the inducer of the antisense construct may be exposed to compounds 
spotted onto the agar surface. A compound's effect may be judged from the diameter 

30 of the resulting killing zone, the area around the compound application point in which 



-140- 



WO 02/051982 



PCT/USOL/50250 



cells do not grow. Multiple compounds may be transferred to agar plates and 
simultaneously tested using automated and semi-automated equipment including but 
not restricted to multi-channel pipettes (for example the Beckman Multimek) and 
multi-channel spotters (for example the Genomic Solutions Flexys). In this way 

5 multiple plates and thousands to millions of compounds may be tested per day. 

The compounds may also be tested entirely in liquid phase using microtiter 
plates as described below. Liquid phase screening may be performed in microtiter 
plates containing 96, 384, 1536 or more wells per microtiter plate to screen multiple 
plates and thousands to millions of compounds per day. Automated and semi- 

10 automated equipment may be used for addition of reagents (for example cells and 
compounds) and determination of cell density. 

The examples below describe cell-based assays that exploit the fusion 
promoters described herein to sensitize cells and allow for the discovery of a diverse 
class of antibiotics. Example 20 describes experiments performed in E. coli, which 

15 verify the effectiveness of the cell-based approach described above. 

EXAMPLE 20 

Cell-Based Assay Using Antisense RNA Complementary to Genes Encoding 

Ribosomal Proteins 

20 The effectiveness of the above cell-based assay was validated using constructs 

expressing antisense RNA to the proliferation required E. coli genes rplL> rpU, and 
rplW encoding ribosomal proteins L7/L12, L10 and L23 respectively. These proteins 
are part of the protein synthesis apparatus of the cell and as such are required for 
proliferation. These constructs were used to test the effect of antisense expression on 

25 cell sensitivity to antibiotics known to bind to the ribosome and thereby inhibit protein 
synthesis. Constructs expressing antisense RNA to several other genes (elaD, visC, 
yohH y and atpE/B), the products of which are not involved in protein synthesis were 
used for comparison. 

First, pLex5BA (Krause et al 1997. 7. Mol Biol 274: 365, the disclosure of 

30 which is incorporated herein by reference in its entirety) expression vectors containing 
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antisense constructs to cither rplWor to elaD were introduced into separate is. coli cell 
populations. Vector introduction is a technique well known to those of ordinary skill 
in the art. The expression vectors of this example contain IPTG inducible promoters 
that drive the expression of the antisense RNA in the presence of the inducer. 

5 However, those skilled in the art will appreciate that other inducible promoters may 
also be used. Suitable expression vectors are also well known in the art. The E. coli 
antisense clones to genes encoding ribosomal proteins L7/L12, L10 and L23 were 
used to test the effect of antisense expression on cell sensitivity to the antibiotics 
known to bind to these proteins. Expression vectors containing antisense to either the 

10 genes encoding L7/L12 and L10 or L23 were introduced into separate E. coli cell 
populations. 

The cell populations were exposed to a range of IPTG concentrations in liquid 
medium to obtain the growth inhibitory dose curve for each clone. First, seed cultures 
were grown to a particular turbidity that is measured by the optical density (OD) of the 

15 growth solution. The OD of the solution is directly related to the number of bacterial 
cells contained therein. Subsequently, sixteen 200 ul liquid medium cultures were 
grown in a 96 well microtiter plate at 37° C with a range of IPTG concentrations in 
duplicate two-fold serial dilutions from 1600 uM to 12.5 uM (final concentration). 
Additionally, control cells were grown in duplicate without IPTG. These cultures 

20 were started from equal amounts of cells derived from the same initial seed culture of 
a clone of interest. The cells were grown for up to 15 hours and the extent of growth 
was determined by measuring the optical density of the cultures at 600 nm. When the 
control culture reached mid-log phase the percent growth (relative to the control 
culture) for each of the IPTG containing cultures was plotted against the log 

25 concentrations of IP TG to produce a growth inhibitory dose response curve for the 
IPTG. The concentration of IPTG that inhibits cell growth to 50% (IC 5 o) as compared 
to the 0 mM IPTG control (0% growth inhibition) was then calculated from the curve. 
Under these conditions, an amount of antisense RNA was produced that reduced the 
expression levels of rplW and elaD to a degree such that growth was inhibited by 

30 50%. 
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Alternative methods of measuring growth are also contemplated Examples of 
these methods include measurements of proteins, the expression of which is 
engineered into the cells being tested and can readily be measured. Examples of such 
proteins include green fluorescent protein (GFP) and various enzymes. 

5 Cells were pretreated with the selected concentration of IPTG and then used to 

test the sensitivity of cell populations to tetracycline, erythromycin and other protein 
synthesis inhibitors. FIG. 12 is an IPTG dose response curve in E. coli transformed 
with an EPTG-inducible plasmid containing either an antisense clone to the £. coli 
rplW gene (AS-rplW) which encodes ribosomal protein L23 which is required for 

10 protein synthesis and essential for cell proliferation, or an antisense clone to the elaD 
(AS-elaD) gene which is not known to be involved in protein synthesis and which is 
also essential for proliferation. 

An example of a tetracycline dose response curve is shown in FIGs. 13A and 
13B for the rplW and elaD genes, respectively. Cells were grown to log phase and 

15 then diluted into media alone or media containing IPTG at concentrations which give 
20% and 50% growth inhibition as determined by IPTG dose response curves. After 
2.5 hours, the cells were diluted to a final ODeoo of 0.002 into 96 well plates 
containing (1) +/- IPTG at the same concentrations used for the 2.5 hour pre- 
incubation; and (2) serial two-fold dilutions of tetracycline such that the final 

20 concentrations of tetracycline range from 1 ^ig/ml to 15.6 ng/ml and 0 ng/ml. The 96 
well plates were incubated at 37°C and the OD 600 was read by a plate reader every 5 
minutes for up to 15 hours. For each IPTG concentration and the no IPTG control, 
tetracycline dose response curves were determined when the control (absence of 
tetracycline) reached 0.1 ODeoo. To compare tetracycline sensitivity with and without 

25 IPTG, tetracycline IC 50s were determined from the dose response curves (FIGs. 13A 
and 13B). Cells with reduced levels of L23 (AS-rplW) showed increased sensitivity to 
tetracycline (FIG. 13A) as compared to cells with reduced levels of the elaD gene 
product (AS-elaD) (FIG. 13B). FIG. 14 shows a summary bar chart in which the 
ratios of tetracycline IC 50 s determined in the presence of IPTG which gives 50% 

30 growth inhibition versus tetracycline ICsos determined without IPTG (fold increase in 
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tetracycline sensitivity) were plotted. Cells with reduced levels of either L7/L12 
(encoded by genes rplL, rpU) or L23 (encoded by the rplW gene) showed increased 
sensitivity to tetracycline (FIG. 14). Cells expressing antisense to genes not known to 
be involved in protein synthesis (AS-atpB/E, AS-raC, AS-elaD 9 AS-yohH) did not 
5 show the same increased sensitivity to tetracycline, validating the specificity of this 
assay (FIG. 14). 

In addition to the above, it has been observed in initial experiments that clones 
expressing antisense RNA to genes involved in protein synthesis (including genes 
encoding ribosomai proteins L7/L12 & L10 s L7/L12 alone, L22, and LI 8, as well as 

10 genes encoding rRNA and Elongation Factor G) have increased sensitivity to the 
macrolide, erythromycin, whereas clones expressing antisense to the non-protein 
synthesis genes elaD, atpB/E and visC do not. Furthermore, the clone expressing 
antisense to rplL and rpU does not show increased sensitivity to nalidixic acid and 
ofloxacin, antibiotics which do not inhibit protein synthesis. 

15 The results with the ribosomai protein genes rplL, rpU, and rplWzs well as the 

initial results using various other antisense clones and antibiotics show that limiting 
the concentration of an antibiotic target makes cells more sensitive to the 
antimicrobial agents that specifically interact with that protein. The results also show 
that these cells are sensitized to antimicrobial agents that inhibit the overall function in 

20 which the protein target is involved but are not sensitized to antimicrobial agents that 
inhibit other functions. 

It will be appreciated that the cell-based assays described above may be implemented 
in Staphylococcus aureus and Enterococcus faecalis using proliferation-inhibiting 
antisense sequences under the control of a fusion promoter described herein. It will 

25 also be appreciated that the above cell-based assays employing the fusion promoters 
described herein can be implemented in other Gram-positive organims, including but 
not limited to, Bacillus anthracis, Clostridium botulinum, Clostridium difficile, 
Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, 
Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria 

30 monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia 
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asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus 
xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling 
within the genera of any of the above species. 

The cell-based assay described above may also be used to identify the 
5 biological pathway in which a proliferation-required nucleic acid or its gene product 
lies. In such methods, cells transcribing a sub-lethal level of antisense to a target 
proliferation-required nucleic acid from the fusion promoters described herein and 
control cells in which expression of the antisense has not been induced are contacted 
with a panel of antibiotics known to act in various pathways. If the antibiotic acts in 

10 the pathway in which the target proliferation-required nucleic acid or its gene product 
lies, cells in which expression of the antisense has been induced will be more sensitive 
to the antibiotic than cells in which expression of the antisense has not been induced. 

As a control, the results of the assay may be confirmed by contacting a panel of 
cells expressing antisense nucleic acids to many different proliferation-required genes 

15 including the target proliferation-required gene. If the antibiotic is acting specifically, 
heightened sensitivity to the antibiotic will be observed only in the cells expressing 
antisense to a target proliferation-required gene (or cells expressing antisense to other 
proliferation-required genes in the same pathway as the target proliferation-required 
gene) but will not be observed generally in all cells expressing antisense to 

20 proliferation-required genes. 

It will be appreciated that the cell-based assays described above may be 
implemented in Staphylococcus aureus and Enterococcus faecalis using proliferation- 
inhibiting antisense sequences under the control of a fusion promoter described herein. 
It will also be appreciated that the above cell-based assays employing the fusion 

25 promoters described herein can be implemented in other Gram-positive organims, 
including but not limited to, Bacillus anthracis, Clostridium botulinum, Clostridium 
difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, 
Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria 
monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia 

30 asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus 
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xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling 
within the genera of any of the above species. 

Similarly, the above method may be used to determine the pathway on which a 
test compound, such as a test antibiotic acts. A panel of cells, each of which 

5 transcribes antisense to a proliferation-required nucleic acid in a known pathway from 
one of the fusion promoters described herein, is contacted with a compound for which 
it is desired to determine the pathway on which it acts. The sensitivity of the panel of 
cells to the test compound is determined in cells in which expression of the antisense 
has been induced and in control cells in which expression of the antisense has not 

10 been induced. If the test compound acts on the pathway on which an antisense nucleic 
acid acts, cells in which expression of the antisense has been induced will be more 
sensitive to the compound than cells in which expression of the antisense has not been 
induced. In addition, control cells in which expression of antisense to proliferation- 
required genes in other pathways has been induced will not exhibit heightened 

15 sensitivity to the compound. In this way, the pathway on which the test compound 
acts may be determined. 

It will be appreciated that the cell-based assays described above may be implemented 
in Staphylococcus aureus and Enterococcus faecalis using proliferation-inhibiting 
antisense sequences under the control of a fusion promoter described herein. It will 

20 also be appreciated that the above cell-based assays employing the fusion promoters 
described herein can be implemented in other Gram-positive organims, including but 
not limited to, Bacillus anthracis, Clostridium botulinum, Clostridium difficile, 
Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae t 
Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria 

25 monocytogenes. Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia 
asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus 
xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling 
within the genera of any of the above species. 

The Example below provides one method for performing such assays. 

30 
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EXAMPLE 21 

^purification of the Pathway in which a Proliferation-Required 
Gene Lies or the Pathway on which an Antibiotic Acts 
A- Preparation of Bacterial Stocks for Assay 

5 To provide a consistent source of cells to screen, frozen stocks of host bacteria 

containing the desired antisense construct are prepared using standard microbiological 
techniques. For example, a single clone of the microorganism can be isolated by 
streaking out a sample of the original stock onto an agar plate containing nutrients for 
cell growth and an antibiotic for which the antisense construct contains a gene which 

10 confers resistance. After overnight growth an isolated colony is picked from the plate 
with a sterile needle and transferred to an appropriate liquid growth media containing 
the antibiotic required for maintenance of the plasmid. The cells are incubated at 
30°C to 37°C with vigorous shaking for 4 to 6 hours to yield a culture in exponential 
growth. Sterile glycerol is added to 15% (volume to volume) and lOO^L to 500 |iL 

15 aliquots are distributed into sterile cryotubes, snap frozen in liquid nitrogen, and 
stored at -80°C for future assays. 
B. Growth of Bacteria for Use in the Assay 

A day prior to an assay, a stock vial is removed from the freezer, rapidly 
thawed (37°C water bath) and a loop of culture is streaked out on an agar plate 

20 containing nutrients for cell growth and an antibiotic to which the antisense construct 
confers resistance. After overnight growth at 37°C, ten randomly chosen, isolated 
colonies are transferred from the plate (sterile inoculum loop) to a sterile tube 
containing 5 mL of appropriate medium containing the antibiotic to which the 
antisense vector confers resistance. After vigorous mixing to form a homogeneous cell 

25 suspension, the optical density of the suspension is measured at 600 nm (ODaoo) and if 
necessary an aliquot of the suspension is diluted into a second tube of 5 mL, sterile, 
appropriate medium plus antibiotic to achieve an ODeoo ^ 0.02 absorbance units. The 
culture is then incubated at 37° C for t-2 hrs with shaking until the OD600 reaches OD 
0.2 - 0.3. At this point the cells are ready to be used in the assay. 

30 C. Selection of Media to be Used in Assay 
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Two-fold dilution series of the inducer are generated in culture media 
containing the appropriate antibiotic for maintenance of the antisense construct 
Several media are tested side by side and three to four wells are used to evaluate the 
effects of the inducer at each concentration in each media. For example, LB broth, 

5 TBD broth, Muller-Hinton, or other media may be tested with the inducer xylose at 
the following concentrations, 5 mM, 10 mM, 20 mM, 40 mM, 80 mM, 120 mM and 
160 mM. Equal volumes of test media-inducer and cells are added to the wells of a 
384 well microtiter plate and mixed. The cells are prepared as described above and 
diluted 1:100 in the appropriate media containing the test antibiotic immediately prior 

10 to addition to the microtiter plate wells. For a control, cells are also added to several 
wells of each media that do not contain inducer, for example 0 mM xylose. Cell 
growth is monitored continuously by incubation at 37°C in a microtiter plate reader 
monitoring the OD 6 oo of the wells over an 18-hour period. The percent inhibition of 
growth produced by each concentration of inducer is calculated by comparing the rates 

15 of logarithmic growth against that exhibited by cells growing in media without 
inducer. The medium yielding greatest sensitivity to inducer is selected for use in the 
assays described below. 

D. Measurement of Test Antibiotic Sensitivity in the Absence of Antisense Construct 
Induction 

20 Two-fold dilution series of antibiotics of known mechanism of action are 

generated in the culture media selected for further assay development that has been 
supplemented with the antibiotic used to maintain the construct. A panel of test 
antibiotics known to act on different pathways is tested side by side with three to four 
wells being used to evaluate the effect of a test antibiotic on cell growth at each 

25 concentration. Equal volumes of test antibiotic and cells are added to the wells of a 
384 well microtiter plate and mixed. Cells are prepared as described above using the 
media selected for assay development supplemented with die antibiotic required to 
maintain the antisense construct and are diluted 1:100 in identical media immediately 
prior to addition to the microtiter plate wells. For a control, cells are also added to 

30 several wells that lack antibiotic, but contain the solvent used to dissolve the 
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antibiotics. Cell growth is monitored continuously by incubation at 37°C in a 
microtiter plate reader monitoring the ODeoo of the wells over an 18-hour period. The 
percent inhibition of growth produced by each concentration of antibiotic is calculated 
by comparing the rates of logarithmic growth against that exhibited by cells growing 
5 in media without antibiotic. A plot of percent inhibition against log[antibiotic 
concentration] allows extrapolation of an IC50 value for each antibiotic. 
E. Measurement of Test Antibiotic Sensitivity in the Presence of Antisense Construct 
Inducer 

The culture media selected for use in the assay is supplemented with inducer at 

10 concentrations shown to inhibit cell growth by 50% and 80% as described above, as 
well as the antibiotic used to maintain the construct. Two fold dilution series of the 
panel of test antibiotics used above are generated in each of these media. Several 
antibiotics are tested side by side in each medium with three to four wells being used 
to evaluate the effects of an antibiotic on cell growth at each concentration. Equal 

15 volumes of test antibiotic and cells are added to the wells of a 384 well microtiter 
plate and mixed. Cells are prepared as described above using the media selected for 
use in the assay supplemented with the antibiotic required to maintain the antisense 
construct. The cells are diluted 1:100 into two 50 mL aliquots of identical media 
containing concentrations of inducer that have been shown to inhibit cell growth by 

20 50% and 80 % respectively and incubated at 37°C with shaking for 2.5 hours. 
Immediately prior to addition to the microtiter plate wells, the cultures are adjusted to 
an appropriate ODeoo (typically 0.002) by dilution into warm (37°C) sterile media 
supplemented with identical concentrations of the inducer and antibiotic used to 
maintain the antisense construct. For a control, cells are also added to several wells 

25 that contain solvent used to dissolve test antibiotics but which contain no antibiotic. 
Cell growth is monitored continuously by incubation at 37°C in a microtiter plate 
reader monitoring the OD«)o of the wells over an 18-hour period. The percent 
inhibition of growth produced by each concentration of antibiotic is calculated by 
comparing the rates of logarithmic growth against that exhibited by cells growing in 

30 media without antibiotic. A plot of percent inhibition against log[antibiotic 
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concentration] allows extrapolation of an IC50 value for each antibiotic. 
R Determining the Specificity of the Test Antibiotics 

A comparison of the IC50S generated by antibiotics of known mechanism of 
action under antisense induced and non-induced conditions allows the pathway in 

5 which a proliferation-required nucleic acid lies to be identified. If cells expressing an 
antisense nucleic acid complementary to a proliferation-required gene are selectively 
sensitive to an antibiotic acting via a particular pathway, then the gene against which 
the antisense acts is involved in the pathway on which the antibiotic acts. 
G. Identification of Pathway in which a Test Antibiotic Acts 

10 As discussed above, the cell-based assay may also be used to determine the 

pathway against which a test antibiotic acts. In such an analysis, the pathways against 
which each member of a panel of antisense nucleic acids acts are identified as 
described above. A panel of ceils, each containing an inducible vector which 
transcribes an antisense nucleic acid complementary to a gene in a known 

15 proliferation-required pathway from one of the fusion promoters described herein, is 
contacted with a test antibiotic for which it is desired to determine the pathway on 
which it acts under inducing and non-inducing conditions. If heightened sensitivity is 
observed in induced cells expressing antisense complementary to a gene in a particular 
pathway but not in induced cells expressing antisense complementary to genes in other 

20 pathways, then the test antibiotic acts against the pathway for which heightened 
sensitivity was observed. 

One skilled in the art will appreciate that further optimization of the assay 
conditions, such as the concentration of inducer used to induce antisense expression 
and/or the growth conditions used for the assay (for example incubation temperature 

25 and media components) may further increase the selectivity and/or magnitude of the 
antibiotic sensitization exhibited. 

The following example confirms the effectiveness of the methods described 

above. 

30 EXAMPLE 22 
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Identification of the Biological Pathway in Which a Proliferation-Required Gene Lies 
The effectiveness of the above assays was validated using proliferation- 
required genes from E. coli which were identified using procedures similar to those 
described above. Antibiotics of various chemical classes and modes of action were 

5 purchased from Sigma Chemicals (St. Louis, MO). Stock solutions were prepared by 
dissolving each antibiotic in an appropriate aqueous solution based on information 
provided by the manufacturer. The final working solution of each antibiotic contained 
no more than 0.2% (w/v) of any organic solvent. To determine their potency against a 
bacterial strain engineered for expression of an antisense complementary to a 

10 proliferation-required 50S ribosomal protein, each antibiotic was serially diluted two 
or three fold in growth medium supplemented with the appropriate antibiotic for 
maintenance of the anti-sense construct. At least ten dilutions were prepared for each 
antibiotic. 25 jiL aliquots of each dilution were transferred to discrete wells of a 384- 
well microplate (the assay plate) using a multi-channel pipette. Quadruplicate wells 

15 were used for each dilution of an antibiotic under each treatment condition (plus and 
minus inducer). Each assay plate contained twenty wells for cell growth controls 
(growth media replacing antibiotic), ten wells for each treatment (plus and minus 
inducer, in this example IPTG). Assay plates were usually divided into the two 
treatments: half the plate containing induced cells and an appropriate concentrations of 

20 inducer (in this example IPTG) to maintain the state of induction, the other half 
containing non-induced ceils in the absence of IPTG. 

Cells for the assay were prepared as follows. Bacterial cells containing a 
construct, from which expression of antisense nucleic acid complementary to rplL and 
rpU> which encode proliferation-required SOS ribosomal subunit proteins, is inducible 

25 in the presence of IPTG, were grown into exponential growth (OD^oo 0.2 to 0.3) and 
then diluted 1:100 into fresh media containing either 400 fiM or 0 jiM inducer (IPTG). 
These cultures were incubated at 37° C for 2.5 hr. After a 2.5 hr incubation, induced 
and non-induced cells were respectively diluted into an assay medium at a final ODox) 
value of 0.0004. The medium contained an appropriate concentration of the antibiotic 

30 for the maintenance of the antisense construct. In addition, the medium used to dilute 
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induced cells was supplemented with 800 \xM IPTG so that addition to the assay plate 
would result in a final IPTG concentration of 400 jiM. Induced and non-induced cell 
suspensions were dispensed (25 jil/well) into the appropriate wells of the assay plate 
as discussed previously. The plate was then loaded into a plate reader, incubated at 

5 constant temperature, and cell growth was monitored in each well by the measurement 
of light scattering at 595 nm. Growth was monitored every 5 minutes until the cell 
culture attained a stationary growth phase. For each concentration of antibiotic, a 
percentage inhibition of growth was calculated at the time point corresponding to mid- 
exponential growth for the associated control wells (no antibiotic, plus or minus 

10 IPTG). For each antibiotic and condition (plus or minus IPTG), a plot of percent 
inhibition versus log of antibiotic concentration was generated and the IC 5 o 
determined. A comparison of the IC50 for each antibiotic in the presence and absence 
of IPTG revealed whether induction of the antisense construct sensitized the ceil to the 
mechanism of action exhibited by the antibiotic. Cells which exhibited a significant 

15 (standard statistical analysis) numerical decrease in the IC50 value in the presence of 
inducer were considered to have an increased sensitivity to the test antibiotic. 

The above results demonstrate that induction of an antisense RNA to genes 
encoding 50S ribosomal subunit proteins results in a selective and highly significant 
sensitization of cells to antibiotics that inhibit ribosomal function and protein 

20 synthesis. The above results further demonstrate that induction of an antisense 
construct to an essential gene sensitizes a microorganism to compounds that interfere 
with that gene product's biological role. This sensitization is restricted to compounds 
that interfere with pathways associated with the targeted gene and its product. It will 
be appreciated that the cell-based assays described above may be implemented in 

25 Staphylococcus aureus and Enterococcus faecalis using proliferation-inhibiting 
antisense sequences under the control of a fusion promoter described herein.. 

The example below describes an analysis performed in Staphylococcus aureus. 

EXAMPLE 23 
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Identification of the Biological Pathway in which a Gene Required for Proliferation of 

Staphylococcus aureus Lies 
Sensitized cells were also used to identify the pathway in which a gene 
required for proliferation of Staphylococcus aureus lies. Antibiotics of various 
5 chemical classes and modes of action were purchased from chemical suppliers, for 
example Sigma Chemicals (St. Louis, MO). Stock solutions were prepared by 
dissolving each antibiotic in an appropriate aqueous solution based on information 
provided by the manufacturer. The final working solution of each antibiotic contained 
no more than 0.2% (w/v) of any organic solvent. 

10 To determine its potency against a bacterial strain containing an antisense 

nucleic acid complementary to the sequence encoding the Beta (P) subunit of gyrase 
(which is required for proliferation) under the control of the Xyl-T5 fusion promoter, 
each antibiotic was serially diluted two or three fold in growth medium supplemented 
with the appropriate antibiotic for maintenance of the anti-sense construct. At least 

15 ten dilutions were prepared for each antibiotic. 

Aliquots (25 ^L) of each dilution were transferred to discrete wells of a 384- 
well microplate (the assay plate) using a multi-channel pipette. Quadruplicate wells 
were used for each dilution of an antibiotic under each treatment condition (plus and 
minus inducer). Each assay plate retained twenty wells for cell growth controls 

20 (growth media replacing antibiotic), ten wells for each treatment (plus and minus 
inducer, in this example xylose). Half the assay plate contained induced cells (in this 
example Staphylococcus aureus cells) and appropriate concentrations of inducer (in 
this example xylose) to maintain the state of induction while the other half of the assay 
plate contained non-induced cells in the absence of inducer. 

25 Cells of a bacterial clone containing a construct in which expression of 

antisense complementary to the sequence encoding the P subunit of gyrase under the 
control of a xylose inducible promoter were grown into exponential growth (OD^o 0.2 
to 0.3) and then diluted 1:100 into fresh media containing either 12 mM or 0 mM 
inducer (xylose). These cultures were incubated at 37° C for 2.5 hr. The presence of 

30 inducer (xylose) in the medium initiates and maintains production of antisense RNA 



-153- 



WO 02/051982 



PCT/USO 1/50250 



from the antisense construct, in this case. After a 2.5 hr incubation, induced and non- 
induced cells were respectively diluted into an assay medium containing an 
appropriate concentration of the antibiotic for the maintenance of the anti-sense 
construct. In addition, medium used to dilute induced cells was supplemented with 24 
5 mM xylose so that addition to the assay plate would result in a final xylose 
concentration of 12 mM. The cells were diluted to a final OD600 value of 0.0004. 

Induced and non-induced cell suspensions were dispensed (25 :l/well) into the 
appropriate wells of the assay plate as discussed previously. The plate was then 
loaded into a plate reader and incubated at constant temperature while cell growth was 

10 monitored in each well by the measurement of light scattering at 595 nm. Growth was 
monitored every 5 minutes until the cell culture attained a stationary growth phase. 
For each concentration of antibiotic, a percentage inhibition of growth was calculated 
at the time point corresponding to mid-exponential growth for the associated control 
wells (no antibiotic, plus or minus xylose). For each antibiotic and condition (plus or 

15 minus xylose), plots of percent inhibition versus Log of antibiotic concentration were 
generated and ICsos determined. 

A comparison of each antibiotic's IC50 in the presence and absence ofinducer 
(in this example xylose) reveals whether induction of the antisense construct 
sensitized the cell to the antibiotic's mechanism of action. If the antibiotic acts 

20 against the P subunit of gyrase, the IC50 of induced cells will be significantly lower 
than the IC50 of uninduced cells. 

FIG. 15 lists the antibiotics tested, their targets, and their fold increase in 
potency between induced cells and uninduced cells. As illustrated in FIG. 15, the 
potency of cefotaxime, cefoxitin, fiisidic acid, lincomycin, tobramycin, trimethoprim 

25 and vancomycin, each of which act on targets other than the p subunit of gyrase, was 
not significantly different in induced cells as compared to uninduced cells. However, 
the potency of novobiocin, which acts against the P subunit of gyrase, was 
significantly different between induced cells and uninduced cells. 

Thus, induction of an antisense nucleic acid complementary to the sequence 

30 encoding the P subunit of gyrase results in a selective and significant sensitization of 
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Staphylococcus aureus cells to an antibiotic which inhibits the activity of this protein. 
Furthermore, the results demonstrate that induction of an antisense construct to an 
essential gene sensitizes a microorganism to compounds that interfere with that gene 
product's biological role. This sensitization is apparently restricted to compounds that 
5 interfere with the targeted gene and its product. 

It will be appreciated that the cell-based assays described above may be implemented 
in Enterococcus faecalis using proliferation-inhibiting antisense sequences under the 
control of a fusion promoter described herein. It will also be appreciated that the 
above cell-based assays employing the fusion promoters described herein can be 

10 implemented in other Gram-positive organims, including but not limited to, Bacillus 
anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, 
Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus 
faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, 
Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, 

15 Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, 
Streptococcus mutans or any species falling within the genera of any of the above 
species. 

Assays utilizing antisense constructs to essentia] genes can be used to identify 

compounds that interfere with the activity of those gene products. Such assays could 
20 be used to identify drug leads, for example antibiotics. 

Assays utilizing antisense constructs to essential genes can be used to identify 

compounds that specifically interfere with the activity of multiple targets in a pathway. 

Such constructs can be used to simultaneously screen a sample against multiple targets 

in one pathway in one reaction (Combinatorial HTS). 
25 Furthermore, as discussed above, panels of antisense construct-containing cells 

may be used to characterize the point of intervention of any compound affecting an 

essential biological pathway including antibiotics with no known mechanism of 

action. 

The following example provides the a method for determining the pathway on 
30 which an antibiotic acts. 
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EXAMPLE 24 

Cell-based assays us in g Promoter Replacement or Operator Insertion 
A gene encoding a gene product required for cellular proliferation is placed 

5 under the control of an inducible promoter, such as one of the inducible fusion 
promoters described herein, using the promoter replacement or operator insertion 
methods described above. Sensitized cells are generated by growing cells in which the 
gene encoding the gene product required for proliferation is under control of the 
inducible fusion promoter in medium containing varying concentrations of inducer 

10 which are less than the inducer concentration in which the promoter has maximal 
activity. The sensitized cells are contacted with candidate compounds to be tested for 
the ability to inhibit proliferation. In some embodiments, the candidate compounds 
may be compounds in a library of compounds generated using combinatorial 
chemistry techniques or compounds in a natural products library. The extent of 

1 5 proliferation of the sensitized cells is compared to the extent of proliferation of control 
cells in which the activity or level of the gene product is higher than in the sensitized 
cells. For example, the control cells may be cells in which the inducible fusion 
promoter is fully active. A compound which inhibits the growth of the sensitized cells 
to a substantially greater extent than the control cells is identified. This compound 

20 may be used as a drug or further optimized to generate a more potent compound. 

It will be appreciated that the cell-based assays described above may be implemented 
in Staphylococcus aureus and Enterococcus faecalis using the fusion promoters or 
operators described herein. It will also be appreciated that the above cell-based assays 
employing the fusion promoters or operators described herein can be implemented in 

25 other Gram-positive organims, including but not limited to, Bacillus anthracis, 
Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium 
teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, 
Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 
tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus 
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epidermidis. Staphylococcus xylosis, Streptococcus pneumoniae. Streptococcus 
mutans or any species felling within the genera of any of the above species. 

The example below describes an approach to determine if an antisense nucleic 
acid known to inhibit cell proliferation of one organism can successfully inhibit cell 
5 proliferation of a second organism. 

EXAMPLE 25 

Identification of Proliferation-Required Genes in Using Antisense Nucleic Acids from 

Heterologous Organisms 

10 Once an antisense nucleic acid which inhibits the proliferation of a Gram- 

positive organism has been identified using the fusion promoters described herein, the 
antisense nucleic acid may be used to identify proliferation-required genes in 
heterologous organisms. In such methods, the antisense nucleic acid is transcribed in 
the heterologous organism and its ability to inhibit proliferation of the heterologous 

1 5 organism is determined. 

In such embodiments, the antisense nucleic acids are inserted into expression 
vectors functional in the organisms in which the antisense nucleic acids are evaluated. 
Those skilled in the art will appreciate that a negative result in a heterologous 
microorganism does not mean that that microorganism is missing that gene nor does it 

20 mean that the gene is unessential. However, a positive result means that the 
heterologous microorganism contains a homologous gene which is required for 
proliferation of that microorganism. The homologous gene may be obtained using the 
methods described herein. Those cells that are inhibited by antisense may be used in 
cell-based assays as described herein for the identification and characterization of 

25 compounds in order to develop antibiotics effective in these microorganisms. Those 
skilled in the art will appreciate that an antisense molecule that works in the 
microorganism from which it was obtained will not always work in a heterologous 
microorganism. 

The antisense nucleic acids that inhibit the growth of Staphylococcus aureus or 
30 Enterococcus faecalis, for example, may also be evaluated for their ability to inhibit 
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the growth of microorganisms other than Staphylococcus aureus or Enterococcus 
faecalis. In particular, the ability of the antisense nucleic acid to inhibit the growth of 
Aspergillus jumigatus, Bacillus anthracis, Burkholderia cepacia, Campylobacter 
jejuni, Candida albicans, Chlamydia pneumoniae, Chlamydia trachomatus, 
5 Clostridium botulinum, Cryptococcus neoformans, Enterobacter cloacae, 
Enterococcus faecalis, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 
Klebsiella pneumoniae, Mycobacterium leprae, Mycobacterium tuberculosis, 
Neisseria gonorrhoeae, Nocardia asteroides, Pseudomonas aeruginosa, Salmonella 
cholerasuis, Salmonella paratyphi, Salmonella typhi, Salmonella typhimurium, 

10 Staphylococcus aureus, Klebsiella pneumoniae, Moxarella catarrhalis, Shigella 
boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, Pseudomonas 
aeruginosa, Staphylococcus epidermidis, Streptococcus pneumoniae, Treponema 
pallidum, Yersinia pestis, or any species falling within the genera of any of the above 
species may be evaluated. 

15 In such methods, expression vectors in which the expression of an antisense 

nucleic acid that inhibits the growth of Staphylococcus aureus or Enterococcus 
faecalis is under the control of an inducible promoter, such as a fusion promoter 
described herein, are introduced into the microorganisms in which they are to be 
evaluated. In some embodiments, the antisense nucleic acids may be evaluated in 

20 microorganisms which are closely related to Staphylococcus aureus or Enterococcus 
faecalis. The ability of these antisense nucleic acids to inhibit the growth of the 
related microorganisms in the presence of the inducer is then measured. 

In one example, thirty-nine antisense nucleic acids which inhibited the growth 
of Staphylococcus aureus were identified using methods such as those described 

25 herein and were inserted into an expression vector such that their expression was 
under the control of a xylose-inducible Xyl-T5 promoter. A vector with a reporter 
gene under control of the Xyl-T5 promoter was used to show that expression from the 
Xyl-T5 promoter in Staphylococcus epidermidis was comparable to that in 
Staphylococcus aureus. 

30 The vectors were introduced into Staphylococcus epidermidis by 
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electroporation as follows: Staphylococcus epidermidis was grown in liquid culture to 
mid-log phase and then harvested by centrifugation. The cell pellet was resusp ended 
in 1/3 culture volume of ice-cold EP buffer (0.625 M sucrose, 1 mM MgCl 2 , pH=4.0), 
and then harvested again by centrifugation. The cell pellet was then resuspended with 

5 1/40 volume EP buffer and allowed to incubate on ice for 1 hour. The cells were then 
frozen for storage at -80°C. For electroporation, 50 p.1 of thawed electrocompetent 
cells were combined with 0.5 ng plasmid DNA and then subjected to an electrical 
pulse of 10 kV/cm, 25 uFarads, 200 ohm using a BioRad gene pulser electroporation 
device. The cells were immediately resuspended with 200 \xl outgrowth medium and 

10 incubated for 2 hours prior to plating on solid growth medium with drug selection to 
maintain the plasmid vector. Colonies resulting from overnight growth of these 
platings were selected, cultured in liquid medium with drug selection, and then 
subjected to dilution plating analysis as described above to test growth sensitivity in 
the presence of the inducer xylose. 

15 The results are shown in Table I below. The first column indicates the 

Molecule Number of the Staphylococcus aureus antisense nucleic acid which was 
introduced into Staphylococcus epidermidis. The second column indicates whether 
the antisense nucleic acid inhibited the growth of Staphylococcus epidermidis, with a 
indicating that growth was inhibited. Of the 39 Staphylococcus aureus antisense 

20 nucleic acids evaluated, 20 inhibited the growth of Staphylococcus epidermidis. 

TABLE! 

Sensitivity of Other Microorganisms to Antisense Nucleic Acids That Inhibit 
Proliferation of Staphylococcus aureus 

25 



Mol. No. 


S. epidermidis 


SaXA005 I 


+ 


SaXA007 


+ 


SaXA008 


+ 


SaXA009 


+ 


SaXAOlO 


+ 
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SaXAOll 


- 


SaXA012 


- 


SaXA013 


- 


SaXA015 


+ 


SaXA017 


- 


SaXA022 


+ 


SaXA023 


- 


SaXA024 


- 


SaXA025 


+ 


SaXA026 




SaXA027 


- 


SaXA027b 


- 


SaXA02c 


- 


SaXA028 


- 


SaXA029 


+ 


SaXA030 


+ 


SaXA032 


+ 


SaXA033 


+ 


SaXA034 


- 


SaXA035 


+ 


SaXA037 


+ 


SaXA039 


- 


SaXA042 


- 


SaXA043 




SaXA044 




SaXA045 


+ 


SaXA051 


+ 


SaXA053 




SaXA056b 
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SaXA059a 


+ 


SaXA060 




SaXA061 


+ 


SaXA062 


+ 


SaXA063 




SaXA065 





It will be appreciated above methods for identifying heterologous proliferation- 
required genes may be implemented in Enterococcus faecalis using the fusion 
promoters or operators described herein. It will be appreciated that the above methods 

5 for identifying heterologous proliferation-required genes using the fusion promoters 
described herein can be implemented in other Gram-positive organims, including but 
not limited to, Bacillus anthracis, Clostridium botulinum, Clostridium difficile, 
Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, 
Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria 

10 monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia 
asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus 
xylosis t Streptococcus pneumoniae, Streptococcus mutans or any species falling 
within the genera of any of the above species. 

Those of skill in the art will appreciate that the methods disclosed herein 

15 provide a relatively easy, rapid, and economical way of identifying essential genes in 
an organism and identifying compounds that alter the expression levels of the 
identified essential genes. 

One skilled in the art will appreciate that these methods and devices are and 
may be adapted to carry out the objects and obtain the ends and advantages mentioned, 

20 as well as those inherent therein. The methods, procedures, and devices described 
herein are presently representative of preferred embodiments and are exemplary and 
are not intended as limitations on the scope of the invention. Changes therein and 
other uses will occur to those skilled in the art which are encompassed within the 
spirit of the invention and are defined by the scope of the disclosure. 
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It will be apparent to one skilled in the ait that varying substitutions and 
modifications may be made to the invention disclosed herein without departing from 
the scope and spirit of the invention. 

Those skilled in the art recognize that the aspects and embodiments of the 
5 invention set forth herein may be practiced separate from each other or in conjunction 
with each other. Therefore, combinations of separate embodiments are within the 
scope of the invention as disclosed herein. 

All patents and publications mentioned in the specification are indicative of 
the levels of those skilled in the art to which the invention pertains. All patents and 
10 publications are herein incorporated by reference to the same extent as if each 
individual publication was specifically and individually indicated to be incorporated 
by reference. 

The invention illustratively described herein suitably may be practiced in the 
absence of any element or elements, limitation or limitations which is not specifically 

15 disclosed herein. Thus, for example, in each instance herein any of the terms 
"comprising", "consisting essentially of 1 and "consisting of may be replaced with 
either of the other two terms. The terms and expressions which have been employed 
are used as terms of description and not of limitation, and there is no intention that in 
the use of such terms and expressions indicates the exclusion of equivalents of the 

20 features shown and described or portions thereof. It is recognized that various 
modifications are possible within the scope of the invention disclosed. Thus, it should 
be understood that although the present invention has been specifically disclosed by 
preferred embodiments and optional features, modification and variation of the 
concepts herein disclosed may be resorted to by those skilled in the art, and that such 

25 modifications and variations are considered to be within the scope of this invention as 
defined by the disclosure. 

In addition, where features or aspects of the invention are described in terms of 
Markush groups, those skilled in the art will recognize that the invention is also 
thereby described in terms of any individual member or subgroup of members of the 

30 Markush group. 
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WHAT IS CLAIMED IS : 

1. An isolated nucleic acid comprising a fusion promoter said fusion 
promoter comprising at least one promoter that is modified to have altered activity in 
at least one gram-positive organism said promoter being linked to at least one operator 
selected from the group consisting of xylO, tetO, trpO, malO and AclO, wherein said 
at least one operator is positioned such that binding of at least one repressor to said at 
least one operator represses transcription from said fusion promoter. 

2. The isolated fusion promoter of Claim 1, wherein said at least one 
promoter is selected from the group consisting of SEQ ID NOs.: 36-45. 

3. The isolated nucleic acid of Claim I, wherein said at least one operator 

is xyiO. 

4. The isolated nucleic acid of Claim 3, wherein said at least one 
promoter is T5. 

5. The isolated nucleic acid of Claim 3, further comprising a second 
operator. 

6. The isolated nucleic acid of Claim 5, wherein said second operator is 

lacO. 

7. The isolated nucleic acid of Claim 1, wherein said fusion promoter is 
responsive to an inducer. 

8. The isolated nucleic acid of Claim 7, wherein said inducer is xylose. 

9. The isolated nucleic acid of Claim 1, wherein said fusion promoter is 
titratable. 

10. The isolated nucleic acid of Claim 1, wherein said at least one gram- 
positive organism is selected from the group consisting of Bacillus anthracis, 
Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium 
teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, 
Lactococcus lactis t Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 
tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus 
epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus 
mutans and any species falling within the genera of any of the above species. 
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11. The isolated nucleic acid of Claim 1, wherein said at least one gram- 
positive organism is Staphylococcus aureus. 

12. The isolated nucleic acid of Claim 1, wherein said at least one gram- 
positive organism is Enterococcus faecalis. 

5 13. An isolated nucleic acid comprising a fusion promoter said fusion 

promoter comprising at least one promoter selected from the group consisting of T5, 
CP25, P32, P59, P1P2 and PL, said promoter being linked to at least one operator 
selected from the group consisting of xylO, tetO, trpO, malO and AclO, wherein said 
at least one operator is positioned such that binding of at least one repressor to said at 
10 least one operator represses transcription from said fusion promoter. 

14. The isolated nucleic acid of Claim 13, wherein said at least one 
operator is xylO. 

15. The isolated nucleic acid of Claim 14, wherein said at least one 
promoter is T5. 

15 16. The isolated nucleic acid of Claim 14, further comprising a second 

operator. 

17. The isolated nucleic acid of Claim 16, wherein said second operator is 

lacO. 

18. The isolated nucleic acid of Claim 13, wherein said fusion promoter is 
20 responsive to an inducer. 

19. The isolated nucleic acid of Claim 1 8, wherein said inducer is xylose. 

20. The isolated nucleic acid of Claim 13, wherein said fusion promoter is 
titratable. 

21. The isolated nucleic acid of Claim 13, wherein said at least one gram- 
25 positive organism is selected from the group consisting of Bacillus anihracis, 

Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium 
teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, 
Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 
tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus 
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epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae. Streptococcus 
mutans and any species falling within the genera of any of the above species. 

22. The isolated nucleic acid of Claim 13, wherein said at least one gram- 
positive organism is Staphylococcus aureus. 
5 23. The isolated nucleic acid of Claim 13, wherein said at least one gram- 

positive organism is Enterococcus faecalis. 

24. An isolated fusion promoter comprising one of SEQ ID NO.: 26-35. 

25. A vector comprising the isolated nucleic acid of Claim 1 . 

26. A vector comprising the isolated nucleic acid of Claim 1 3. 
10 27. A vector comprising the isolated nucleic acid of Claim 24. 

28. The vector of Claim 25 further comprising at least one replicon 
selected from the group consisting of pl5a, pC194 and pCTl 138. 

29. The vector of Claim 25 further comprising a reporter gene operably 
linked to said fusion promoter. 

15 30. The vector of Claim 29, wherein said reporter gene is lacL-lacM. 

31. The vector of Claim 30, wherein lacLAacM is derived from 
Leuconostoc mesenteroides, 

32. The vector of Claim 25, wherein said at least one operator is xylO. 

33. The vector of Claim 32, wherein said at least one promoter is T5. 
20 34. The vector of Claim 32, further comprising a second operator. 

35. The vector of Claim 34, wherein said second operator is lacO. 

36. The vector of Claim 25, wherein said fusion promoter is responsive to 
an inducer. 

37. The vector of Claim 36, wherein said inducer is xylose. 

25 38. The vector of Claim 25, wherein said fusion promoter is titratable. 

39. The vector of Claim 25, wherein said at least one gram-positive 
organism is selected from the group consisting of Bacillus anthracis, Clostridium 
botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, 
Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, 

30 Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 
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tuberculosis, Nocardia aster oides, Staphylococcus aureus, Staphylococcus 
epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus 
mutans and any species falling within the genera of any of the above species. 

40. The vector of Claim 25, wherein said at least one gram-positive 
5 organism is Staphylococcus aureus. 

41. The vector of Claim 25, wherein said at least one gram-positive 
organism is Enterococcus faecalis. 

42. The vector of Claim 25, further comprising a random fragment of a 
microbial genome operably linked to said fusion promoter. 

10 43. The vector of Claim 25, further comprising a nucleic acid that encodes 

a peptide, wherein said nucleic acid is operably linked to said fusion promoter. 

44. The vector of Claim 25, further comprising a nucleic acid that is 

complementary to a portion of a microbial genome, wherein said nucleic acid is 

operably linked to said fusion promoter. 
15 45. The vector of Claim 25, further comprising a nucleic acid that encodes 

a molecule that inhibits the proliferation of microbe, wherein said nucleic acid is 

operably linked to said fusion promoter. 

46. A host cell comprising the nucleic acid of Claim 1 . 

47. A host cell comprising the nucleic acid of Claim 1 3 

20 48. A method for identifying genes involved in cellular proliferation said 

method comprising the steps of: 

(a) introducing into cells of a cell population a construct comprising an 
inducible fusion promoter operably linked to a nucleic acid, said fusion 
promoter comprising at least one promoter that is modified to have altered 

25 activity in at least one gram-positive organism said promoter being linked to at 

least one operator selected from the group consisting oixylO, tetO. trpO, malO 
and Ac 10, wherein said at least one operator is positioned such that binding of 
at least one repressor to said at least one operator represses transcription from 
said fusion promoter; 



! 
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(b) inducing transcription of said nucleic acid from said inducible 
fusion promoter, 

(c) identifying the cells in said cell population whose proliferation is 
reduced in response to the induction of transcription of said nucleic acid; and 

(d) identifying the gene from a cell identified in step (c) to which at 
least a portion of said nucleic acid is complementary. 

49. A method for identifying genes involved in cellular proliferation said 
method comprising the steps of: 

(a) introducing into cells of a cell population a construct comprising an 
inducible fusion promoter operably linked to a nucleic acid, said fusion 
promoter comprising at least one promoter selected from the group consisting 
of T5, CP25, P32, P59, P1P2 and PL, said promoter being linked to at least 
one operator selected from the group consisting of xylO, tetO, trpO, malO and 
XclO y wherein said at least one operator is positioned such that binding of at 
least one repressor to said at least one operator represses transcription from 
said fusion promoter; 

(b) inducing transcription of said nucleic acid from said inducible 
fusion promoter; 

(c) identifying the cells in said cell population whose proliferation is 
reduced in response to the induction of transcription of said nucleic acid; and 

(d) identifying the gene from a cell identified in step (c) to which at 
least a portion of said nucleic acid is complementary. 

50. The method of Claim 48, wherein said nucleic acid is random fragment 
of a microbial genome. 

51. The method of Claim 48, wherein said nucleic acid encodes an 

aptamer. 

52. The method of Claim 48, wherein said nucleic acid encodes a transcript 
that is complementary to a portion of a microbial genome. 

53. The method of Claim 48, wherein said nucleic acid encodes a molecule 
that inhibits the proliferation of a microbe. 
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54. The method of Claim 48, wherein said at least one operator is xylO. 

55. The method of Claim 54, wherein said fusion promoter is T5. 

56. The method of Claim 54, wherein said fusion promoter further 
comprises a second operator. 

57. The method of Claim 56, wherein said second operator is lacO. 

58. The method of Claim 54, wherein said inducer is xylose. 

59. The method of Claim 54, wherein said fusion promoter is titratable. 

60. The method of Claim 48, wherein said at least one gram-positive 
organism is selected from the group consisting of Bacillus anthracis, Clostridium 
botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, 
Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, 
Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 
tuberculosis, Nocardia asteroides. Staphylococcus aureus, Staphylococcus 
epidermidis, Staphylococcus xylosis. Streptococcus pneumoniae, Streptococcus 
mutatis and any species falling within the genera of any of the above species. 

61. The method of Claim 48, wherein said at least one gram-positive 
organism is Staphylococcus aureus. 

62. The method of Claim 48, wherein said at least one gram-positive 
organism is Enterococcus faecalis. 

63. A method for identifying genes involved in cellular proliferation said 
method comprising the steps of: 

(a) introducing into the genome of a cell a construct comprising an 
inducible fusion promoter operably linked to a nucleic acid, said fusion 
promoter comprising at least one promoter that is modified to have altered 
activity in at least one gram-positive organism said promoter being linked to at 
least one operator selected from the group consisting of xylO, tetO, trpO, malO 
and AclO> wherein said at least one operator is positioned such that binding of 
at least one repressor to said at least one operator represses transcription from 
said fusion promoter; 
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(b) comparing the proliferation of said cell cultured in the presence of a 
first concentration of an inducer that induces transcription from said fusion 
promoter with the proliferation of said cell cultured in the presence of a 
concentration of said inducer that is less than said first concentration, wherein 
a difference in proliferation indicates that said fusion promoter modulates a 
gene that is required for proliferation; and 

(c) identifying the gene that is modulated by said fusion promoter. 

64. A method for identifying genes involved in cellular proliferation said 
method comprising the steps of: 

(a) introducing into the genome of a cell a construct comprising an 
inducible fusion promoter operably linked to a nucleic acid, said fusion 
promoter comprising at least one promoter selected from the group consisting 
of T5, CP25, P32, P59, P1P2 and PL, said promoter being linked to at least 
one operator selected from the group consisting of xylO, tetO, trpO, malO and 
Ac 10, wherein said at least one operator is positioned such that binding of at 
least one repressor to said at least one operator represses transcription from 
said fusion promoter; 

(b) comparing the proliferation of said cell cultured in the presence of a 
first concentration of an inducer that induces transcription from said fusion 
promoter with the proliferation of said cell cultured in the presence of a 
concentration of said inducer that is less than said first concentration, wherein 
a difference in proliferation indicates that said fusion promoter modulates a 
gene that is required for proliferation; and 

(c) identifying the gene that is modulated by said fusion promoter. 

65. The method of Claim 63, wherein said nucleic acid is random fragment 
of a microbial genome. 

66. The method of Claim 63, wherein said nucleic acid encodes an 
aptamer. 

67. The method of Claim 63, wherein said nucleic acid encodes a transcript 
that is complementary to a portion of a microbial genome. 
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68. The method of Claim 63, wherein said nucleic acid encodes a molecule 
that inhibits the proliferation of a microbe. 

69. The method of Claim 63, wherein said at least one operator is xylO. 

70. The method of Claim 69, wherein said at least one promoter is T5. 

71. The method of Claim 69, wherein said fusion promoter further 
comprises a second operator. 

72. The method of Claim 7 1 wherein said second operator is lacO. 

73. The method of Claim 63, wherein said inducer is xylose. 

74. The method of Claim 63, wherein said fusion promoter is titratabie. 

75. The method of Claim 63, wherein said at least one gram-positive 
organism is selected from the group consisting of Bacillus anthracis, Clostridium 
botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, 
Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, 
Lactococcus lactis t Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 
tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus 
epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus 
mutans and any species falling within the genera of any of the above species. 

76. The method of Claim 63, wherein said at least one gram-positive 
organism is Staphylococcus aureus. 

77. The method of Claim 63, wherein said at least one gram-positive 
organism is Enterococcus faecalis. 

78. A method for identifying a compound which reduces the activity or 
level of a gene product required for proliferation of a cell said method comprising: 

(a) introducing into a cell a construct comprising an inducible fusion 
promoter operably linked to a nucleic acid that is complementary to at least a 
portion of a proliferation-required gene, said fusion promoter comprising at 
least one promoter that is modified to have altered activity in at least one 
gram-positive organism said promoter being linked to at least one operator 
selected from the group consisting of xylO, tetO, trpO, malO and AcJO, 
wherein said at least one operator is positioned such that binding of at least one 
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repressor to said at least one operator represses transcription from said fusion 
promoter; 

(b) sensitizing said cell by inducing transcription from said fusion 
promoter; 

(c) contacting said sensitized cell with a compound; and 

(d) determining the degree to which said compound inhibits 
proliferation of said sensitized cell relative to a cell which has not been 
sensitized. 

79. A method for identifying a compound which reduces the activity or 
level of a gene product required for proliferation of a cell said method comprising: 

(a) introducing into a cell a construct comprising an inducible fusion 
promoter operably linked to a nucleic acid that is complementary to at least a 
portion of a proliferation-required gene, said fusion promoter comprising at 
least one promoter selected from the group consisting of T5, CP25, P32, P59, 
P1P2 and PL, said promoter being linked to at least one operator selected from 
the group consisting of xylO, tetO, trpO, malO and Ac 10, wherein said at least 
one operator is positioned such that binding of at least one repressor to said at 
least one operator represses transcription from said fusion promoter; 

(b) sensitizing said cell by inducing transcription from said fusion 
promoter; 

(c) contacting said sensitized cell with a compound; and 

(d) determining the degree to which said compound inhibits 
proliferation of said sensitized cell relative to a cell which has not been 
sensitized. 

80. The method of Claim 78, wherein said at least one operator is xylO. 

8 1 . The method of Claim 80, wherein said at least one promoter is T5 . 

82. The method of Claim 80, wherein said fusion promoter further 
comprises a second operator. 

83. The method of Claim 82, wherein said second operator is lacO. 

84. The method of Claim 78, wherein said inducer is xylose. 
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85. The method of Claim 78, wherein said fiision promoter is titratable. 

86. The method of Claim 78, wherein said at least one gram-positive 
organism is selected from the group consisting of Bacillus anthracis, Clostridium 
botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, 

5 Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, 
Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 
tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus 
epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus 
mutans and any species falling within the genera of any of the above species. 
10 87. The method of Claim 78, wherein said at least one gram-positive 

organism is Staphylococcus aureus. 

88. The method of Claim 78, wherein said at least one gram-positive 
organism is Enterococcus faecalis. 

89. A compound identified using the method of Claim 78. 

15 90. A method for inhibiting the activity or expression of a gene in an 

operon required for proliferation said method comprising: 

(a) introducing into a cell a construct comprising an inducible fusion 
promoter operably linked to a nucleic acid that is complementary to at least a 
portion of a proliferation-required operon, said fusion promoter comprising at 

20 least one promoter that is modified to have altered activity in at least one 

gram-positive organism said promoter being linked to at least one operator 
selected from the group consisting of xylO, tetO, trpO, malO and AclO, 
wherein said at least one operator is positioned such that binding of at least one 
repressor to said at least one operator represses transcription from said fusion 

25 promoter; and 

(b) inducing transcription from said fusion promoter. 

91. A method for inhibiting the activity or expression of a gene in an 
operon required for proliferation said method comprising: 

(a) introducing into a cell a construct comprising an inducible fusion 
30 promoter operably linked to a nucleic acid that is complementary to at least a 
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portion of a proliferation-required operon, said fusion promoter comprising at 
least one promoter selected from the group consisting of T5, CP25, P32, P59, 
P1P2 and PL, said promoter being linked to at least one operator selected from 
the group consisting of xylO, tetO, trpO, malO and Xcl0 9 wherein said at least 
5 one operator is positioned such that binding of at least one repressor to said at 

least one operator represses transcription from said fusion promoter; and 
(b) inducing transcription from said fusion promoter. 

92. The method of Claim 90, wherein said at least one operator is xylO. 

93. The method of Claim 92, wherein said at least one promoter is T5. 

10 94. The method of Claim 92, wherein said fusion promoter further 

comprises a second operator. 

95. The method of Claim 94, wherein said second operator is lacO. 

96. The method of Claim 90, wherein said inducer is xylose. 

97. The method of Claim 90, wherein said fusion promoter is titratable. 

15 98. The method of Claim 90, wherein said at least one gram-positive 

organism is selected from the group consisting of Bacillus anthracis, Clostridium 
botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, 
Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, 
Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 
20 tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus 
epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus 
mutans and any species falling within the genera of any of the above species. 

99. The method of Claim 90, wherein said at least one gram-positive 
organism is Staphylococcus aureus. 
25 100. The method of Claim 90, wherein said at least one gram-positive 

organism is Enterococcus faecalis. 

101. A method of manufacturing an antibiotic comprising the steps of: 

(a) introducing into a cell a construct comprising an inducible fusion 
promoter operably linked to a nucleic acid that is complementary to at least a 
30 portion of a proliferation-required gene, said fusion promoter comprising at 
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least one promoter that is modified to have altered activity in at least one 
gram-positive organism said promoter being linked to at least one operator 
selected from the group consisting of xylO, tetO, trpO, malO and ZclO, 
wherein said at least one operator is positioned such that binding of at least one 
5 repressor to said at least one operator represses transcription from said fusion 

promoter; 

(b) sensitizing said cell by inducing transcription from said fusion 
promoter; 

(c) contacting said sensitized cell with a compound; 

10 (d) identifying a compound which substantially inhibits the 

proliferation of said sensitized cell relative to a cell which has not been 
sensitized; and 

(e) manufacturing the compound so identified. 
102. A method of manufacturing an antibiotic comprising the steps of: 

15 (a) introducing into a cell a construct comprising an inducible fusion 

promoter operably linked to a nucleic acid that is complementary to at least a 
portion of a proliferation-required gene, said fusion promoter comprising at 
least one promoter selected from the group consisting of T5, CP25, P32, P59, 
P1P2 and PL, said promoter being linked to at least one operator selected from 

20 the group consisting of xylO, tetO, trpO, malO and AclO, wherein said at least 

one operator is positioned such that binding of at least one repressor to said at 
least one operator represses transcription from said fusion promoter; 

(b) sensitizing said cell by inducing transcription from said fusion 
promoter; 

25 (c) contacting said sensitized cell with a compound; 

(d) identifying a compound which substantially inhibits the 
proliferation of said sensitized cell relative to a cell which has not been 
sensitized; and 

(e) manufacturing the compound so identified. . 

30 103. The method of Claim 101, wherein said at least one operator is xylO. 
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104. The method of Claim 103, wherein said at least one promoter is T5. 

105. The method of Claim 103, wherein said fusion promoter further 
comprises a second operator. 

106. The method of Claim 105, wherein said second operator is lacO. 
5 107. The method of Claim 101, wherein said inducer is xylose. 

108. The method of Claim 101, wherein said fusion promoter is titratable. 

109. The method of Claim 101, wherein said at least one gram-positive 
organism is selected from the group consisting of Bacillus anthracis, Clostridium 
botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, 

10 Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, 
Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 
tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus 
epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus 
mutans and any species falling within the genera of any of the above species. 

15 110. The method of Claim 101, wherein said at least one gram-positive 

organism is Staphylococcus aureus. 

111. The method of Claim 101, wherein said at least one gram-positive 
organism is Enterococcus faecalis. 

1 12. A method for identifying a gene which is required for proliferation of a 
20 prokaryotic cell said method comprising the steps of: 

(a) replacing the native promoter of a gene in the chromosome of a 
prokaryotic cell having an enhanced frequency of homologous recombination 
with a regulatable fusion promoter comprising at least one promoter that is 
modified to have altered activity in at least one gram-positive organism said 
25 promoter being linked to at least one operator selected from the group 

I consisting of xylO, tetO, trpO, malO, lacO and AclO, wherein said at least one 

i 

i operator is positioned such that binding of at least one repressor to said at least 

| one operator represses transcription from said fusion promoter, and 
I (b) identifying cells in which the extent of proliferation of said cell 

30 when said fusion promoter is active at a first level is substantially different 
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than the extent of proliferation of said cell when said fusion promoter is active 
at a second level, said firet level being greater than said second level. 

113. A method for identifying a gene which is required for proliferation of a 
prokaryotic cell said method comprising the steps of: 

(a) replacing the native promoter of a gene in the chromosome of a 
prokaryotic cell having an enhanced frequency of homologous recombination 
with a regulatabie fusion promoter comprising at least one promoter selected 
from the group consisting of T5, CP25, P32, P59, P1P2 and PL, said promoter 
being linked to at least one operator selected from the group consisting of 
xylO, tetO, trpO, malO, lacO and AcIO, wherein said at least one operator is 
positioned such that binding of at least one repressor to said at least one 
operator represses transcription from said fusion promoter; and 

(b) identifying cells in which the extent of proliferation of said cell 
when said fusion promoter is active at a first level is substantially different 
than the extent of proliferation of said cell when said fusion promoter is active 
at a second level, said first level being greater than said second level. 

1 14. The method of Claim 1 12, wherein said fusion promoter is an inducible 
promoter. 

115. The method of Claim 112, wherein the step of replacing said native 
promoter comprises introducing a linear nucleic acid comprising a 5' portion 
homologous to a first portion of said native promoter, a 3' portion homologous to a 
second portion of said native promoter and said fusion promoter disposed between 
said 5' portion and said 3' portion into said cell such that homologous recombination 
occurs between said 5' portion and said first portion of said native promoter and 
between said 3' portion and said second portion of said native promoter. 

116. The method of Claim 115, wherein said linear nucleic acid is double 
stranded. 

117. The method of Claim 115, wherein said linear nucleic acid is single 
stranded. 
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118. A method for identifying a compound which inhibits the proliferation 
of a prokaryotic cell said method comprising the steps of: 

(a) replacing the native promoter of a gene in the chromosome of a 
prokaryotic cell having an enhanced frequency of homologous recombination 
with a regulatable fusion promoter comprising at least one promoter that is 
modified to have altered activity in at least one gram-positive organism said 
promoter being linked to at least one operator selected from the group 
consisting of xylO, tetO, trpO, malO. lacO and AcJO, wherein said at least one 
operator is positioned such that binding of at least one repressor to said at least 
one operator represses transcription from said fusion promoter; and 

(b) comparing the extent of proliferation of a first sample of said cell in 
the presence of said compound to the extent of proliferation of a second 
sample of said cell in the presence of said compound, wherein said first sample 
of said cell has a reduced activity of said fusion promoter activity relative the 
activity of said fusion promoter in said second sample of said cell and wherein 
said compound inhibits the proliferation of said cell if the extent of 
proliferation of said first sample of said cell is substantially less than the extent 
of proliferation of said second sample of said cell. 

119. A method for identifying a compound which inhibits the proliferation 
of a prokaryotic cell said method comprising the steps of: 

(a) replacing the native promoter of a gene in the chromosome of a 
prokaryotic cell having an enhanced frequency of homologous recombination 
with a regulatable fusion promoter comprising at least one promoter selected 
from the group consisting of T5, CP25, P32, P59, P1P2 and PL, said promoter 
being linked to at least one operator selected from the group consisting of 
xylO, tetO, trpO t malO, lacO and Ac 10, wherein said at least one operator is 
positioned such that binding of at least one repressor to said at least one 
operator represses transcription from said fusion promoter, and 

(b) comparing the extent of proliferation of a first sample of said cell in 
the presence of said compound to the extent of proliferation of a second 
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sample of said cell in the presence of said compound, wherein said first sample 
of said cell has a reduced activity of said fusion promoter activity relative the 
activity of said fusion promoter in said second sample of said cell and wherein 
said compound inhibits the proliferation of said cell if the extent of 
proliferation of said first sample of said cell is substantially less than the extent 
of proliferation of said second sample of said cell.. 

120. The method of Claim 118, wherein the step of replacing said native 
promoter comprises introducing a linear nucleic acid comprising a 5' portion 
homologous to a first portion of said native promoter, a 3* portion homologous to a 
second portion of said native promoter and said fusion promoter disposed between 
said 5' portion and said 3' portion into said cell such that homologous recombination 
occurs between said 5' portion and said first portion of said native promoter and 
between said 3' portion and said second portion of said native promoter. 

121. The method of Claim 120, wherein said linear nucleic acid is double 
stranded. 

122. The method of Claim 120, wherein said linear nucleic acid is single 
stranded. 

123. A method for identifying a gene which is required for proliferation of a 
prokaryotic cell said method comprising the steps of: 

(a) introducing at least one operator selected from the group consisting 
of xy/0, tetO, trpO, malO, lacO and AclO into a prokaryotic cell having an 
enhanced frequency of homologous recombination such that said at least one 
operator regulates transcription of a target nucleic acid in the chromosome of 
said cell; and 

(b) identifying cells in which the extent of proliferation of said cell 
when said target nucleic acid is transcribed at a first level is substantially 
different than the extent of proliferation of said cell when said target nucleic 
acid is transcribed at a second level, said first level being greater than said 
second level. 
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124. The method of Claim 123, wherein the step of introducing said at least 
one operator comprises introducing a linear nucleic acid comprising a 5' portion 
homologous to a first portion of the chromosome of said cell, a 3' portion homologous 
to a second portion of the chromosome of said cell and said at least one operator 
disposed between said 5' portion and said 3' portion into said cell such that 
homologous recombination occurs between the 5* portion and said first portion of the 
chromosome of said cell and between the 3' portion and said second portion of the 
chromosome of said cell. 

125. The method of Claim 124, wherein said linear nucleic acid is double 

stranded. 

126. The method of Claim 124, wherein said linear nucleic acid is single 
stranded. 

127. A method of identifying a compound which inhibits the proliferation of 
a prokaryotic celt said method comprising the steps of: 

(a) obtaining a prokaryotic cell in which transcription of a nucleic acid 
required for proliferation of said cell is regulated by at least one operator 
which has been introduced into the chromosome of said cell said at least one 
operator selected from the group consisting of xylO. tetO, trpO, malO, lacO 
and hclO\ 

(b) sensitizing said cell by growing said cell under conditions in which 
the level of transcription of said gene is lower than that of a wild type cell; 

(c) contacting said sensitized cell with said compound; and 

(d) determining the degree to which said compound inhibits the growth 
of said sensitized cell relative to an unsensitized cell. 

128. The method of Claim 127, wherein said cell has an enhanced frequency 
of homologous recombination. 

129. The method of Claim 128, wherein said linear nucleic acid is single 
stranded. 

130. The method of Claim 128, wherein said linear nucleic acid is double 
stranded. 
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131. A method of identilying a nucleic acid sequence having promoter 
activity in Enterococcus faecalis said method comprising the steps of: 

(a) inserting a candidate nucleic acid into a vector comprising lacL- 
lacM reporter genes such that said candidate nucleic acid is upstream of the 

5 lacL-lacM reporter genes; 

(b) introducing said vector comprising said candidate nucleic acid into 
Enterococcus faecalis; 

(c) detecting expression of the lacL-lacM reporter genes, wherein 
expression of said lacLAacM reporter genes indicates that said candidate 

10 nucleic acid sequence has promoter activity. 

132. The method of Claim 131, wherein said detecting step comprises 
measuring P-galactosidase activity. 

133. The method of Claim 131, wherein said candidate nucleic acid is a 
promoter modified to increase activity in a gram-positive organism. 

15 134. The method of Claim 131, wherein said lacL-lacM reporter gene is 

derived from Leuconostoc mesenteroides. 

135. The method of Claim 131, wherein said vector is pEPEFl. 



20 
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5 6 
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6 

GCCTGATACAGATTAAATCAGAACGCAGAAGCGGTCTGATAAAACAGAATTTG 

CCTGGCGGCAGTAGCGCGGTGGTCCCACCTGACCCCATGCCGAACTCAGAAGT 

GAAACGCCGTAGCGCCGATGGTAGTGTGGGGTCTCCCCATGCGAGAGTAGGGA 

ArTnrrAr.nrATrA AATAAAACGAAAGGCTCAGTCG AAAGACTGGGCCTTTCG 

7A 

TTTTAT CTGTTGTTTGTCGGTGAACGCTCTCCTGAGTAGGACAAATCCGCCGGG 
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CCTTTTTGCGTTTCTACAAACTC 
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SEQUENCE LISTING 

<110> ELITRA PHARMACEUTICALS , INC. 
KASELBECK, Robert 
WALL, Daniel 
GROSS, Molly 

<120> BACTERIAL PROMOTERS AND METHODS OF USE 



<130> ELITRA. 010VPC 



<150> US 60/259,434 
<151> 2000-12-27 



<150> US 09/948,993 
<151> 2001-09-06 



<160> 68 



<170> FastSEQ for Windows Version 4.0 



<210> 1 
<211> 556 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Sequence map showing the Xyl-T5 fusion promoter 
and associated DNA sequences 



<400> 1 

gaccgtcata aaaaatttat ttgctttcag gaaaattttt ctgtataata gattcaagtt 60 

agtttgttta ttaaattaac caactaaaat gtagaattcc gagctcgagc cggtacccgg 120 

ggatcctcta gagtcgacct gcagccaagc ttgggctttt cagcctgata cagattaaat 180 

cagaacgcag aagcggtctg ataaaacaga atttgcctgg cggcagtagc gcggtggtcc 240 

cacctgaccc catgccgaac tcagaagtga aacgccgtag cgccgatggt agtgtggggt 300 

ctccccatgc gagagtaggg aactgccagg catcaaataa aacgaaaggc tcagtcgaaa 360 

gactgggcct ttcgttttat ctgttgtttg tcggtgaacg ctctcctgag taggacaaat 420 

ccgccgggag cggatttgaa cgttgcgaag caacggcccg gagggtggcg ggcaggacgc 480 

ccgccataaa ctgccaggca tcaaattaag cagaaggcca tcctgacgga tggccttttt 54 0 
gcgtttctac aaactc 556 

<210> 2 
<211> 168 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> CP25 fusion promoter 



<400> 2 

attaaatata aataacattc aaatgaatat 
tagatatctc ggaccgacat taaattcttg 
tagttgtctc gagagttagt ttgtttatta 



taaataatta cagcagtctg agttataaaa 60 
acagggagag ataggtttga tagaatataa 120 
aattaaccaa ctggacga 168 



<210> 3 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> PCR primer 
<400> 3 

tggcattcta cgactataac 20 

<210> 4 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 4 

gcgaattcta cattttagtt ggttaattt 29 

<210> 5 
<211> 94 
<212> DNA. 

<213> Escherichia coli 
<220> 

<223> Oligonucleotide Xyl-T5 
<400> 5 

gaccgtcata aaaaatttat ttgctttcag gaaaattttt ctgtataata gattcaagtt 60 
agtttgttta ttaaattaac caactaaaat gtag 94 

<210> 6 
<211> 95 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide Xyl-T5 complement 
<400> 6 

aattctacat tttagttggt taatttaata aacaaactaa cttgaatcta ttatacagaa 60 
aaattttcct gaaagcaaat aaatttttta tgacg 95 

<210> 7 
<211> 10929 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Vector pEP25 
<400> 7 

aagcttgcct gcaggtcgac gataaccaca taacagtcat aaaaccactc ctttttaaca 60 
aactttatca caagaaatat tttggcattc tacgactata acttaaattt atatttttta 120 
ctttataata tataattgat tatagaataa tgttgctcat atcgtttgcc aacatctagt 180 
actcaaatta cactatgtta cacttggtaa tattaaccga acttcccctg tccaaattag 240 
ataagaggta ataataaatg gaaaataatt ttatagtaaa tgaaaatgag aagcgtgtat 300 
taaaacaaat tttcaataac agcaatattt cacgaacaca aatatcgaag aatttagaac 360 
ttaataaagc tactatttct aacattctga acaacttaaa acacaagagt ttagttaatg 420 
aagtaggaga aggtaatagt actaaaagtg gtggacgaaa gcctatttta ctcgaaatta 480 
accaaaaata tggctactat atttctatgg atttaacata tgattccgtt gaattaatgt 540 
acaactactt tgatgctact atattaaagc aagattccta cgaattaaat gataaaaatg 600 
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taagcagtat attacaaatt ttaaaatcta 
tatatgggtt acttggtata tctatatcca 
taatcaatct tccttttcat aaaaatgaga 
tcacaaatgt tcctgtcgtt atagaaaatg 
gtttatatat taattcaaac ataaataatt 
gcgctggcat cataataaat aaaaaacttt 
taggtaagac attggttttg gaatctataa 
aagatatatg ctcccaagac gctttaatac 
tgacgtttac agaactaatc caatattaca 
ttaaacaatt tattaataaa atgacagttc 
cagacgctat ttatattaac tgtcctttaa 
ttaaagagca attctcctgt ttttctcaag 
atgtaaaaca agctacttta ttgggtggca 
taaataacat tcaaatgaat attaaataat 
tcggaccgct ttggcagttt attcttgaca 
ctcgagagtt agtttgttta ttaaattaac 
tgattacgga ttcactggcc gtcgttttac 
cccaacttaa tcgccttgca gcacatcccc 
cccgcaccga tcgcccttcc caacagttgc 
ggtttccggc accagaagcg gtgccggaaa 
atactgtcgt cgtcccctca aactggcaga 
acgtgaccta tcccattacg gtcaatccgc 
gttactcgct cacatttaat gttgatgaaa 
tttttgatgg cgttaactcg gcgtttcatc 
gccaggacag tcgtttgccg tctgaatttg 
accgcctcgc ggtgatggtg ctgcgctgga 
tgtggcggat gagcggcatt ttccgtgacg 
tcagcgattt ccatgttgcc actcgcttta 
ctgaagttca gatgtgcggc gagttgcgtg 
agggtgaaac gcaggtcgcc agcggcaccg 
gtggtggtta tgccgatcgc gtcacactac 
gcgccgaaat cccgaatctc tatcgtgcgg 
tgattgaagc agaagcctgc gatgtcggtt 
tgctgctgaa cggcaagccg ttgctgattc 
tgcatggtca ggtcatggat gagcagacga 
acaactttaa cgccgtgcgc tgttcgcatt 
gcgaccgcta cggcctgtat gtggtggatg 
caatgaatcg tctgaccgat gatccgcgct 
gaatggtgca gcgcgatcgt aatcacccga 
caggccacgg cgctaatcac gacgcgctgt 
gcccggtgca gtatgaaggc ggcggagccg 
tgtacgcgcg cgtggatgaa gaccagccct 
aatggctttc gctacctgga gagacgcgcc 
tgggtaacag tcttggcggt ttcgctaaat 
tacagggcgg cttcgtctgg gactgggtgg 
gcaacccgtg gtcggcttac ggcggtgatt 
gtatgaacgg tctggtcttt gccgaccgca 
accagcagca gtttttccag ttccgtttat 
acctgttccg tcatagcgat aacgagctcc 
cgctggcaag cggtgaagtg cctctggatg 
tgcctgaact accgcagccg gagagcgccg 
aaccgaacgc gaccgcatgg tcagaagccg 
tggcggaaaa cctcagtgtg acgctccccg 
ccagcgaaat ggatttttgc atcgagctgg 
caggctttct ttcacagatg tggattggcg 
atcagttcac ccgtgcaccg ctggataacg 
accctaacgc ctgggtcgaa cgctggaagg 
tgttgcagtg cacggcagat acacttgctg 
ggcagcatca ggggaaaacc ttatttatca 
gtcaaatggc gattaccgtt gatgttgaag 
ttggcctgaa ctgccagctg gcgcaggtag 



atataaacgt ctcagaaaaa tatgatacgt 660 
tacacggtat cgttgacgat gagcaaaaca 720 
aacgcacatt taccgatgaa ttaaagtcat 780 
aagcaaattt atcagcgcta tatgaaaaaa 84 0 
tgattacttt aagtattcac aagggtatag 900 
atcgtggctc aaatggagag gctggagaga 960 
ataacaatga caacaaatat tataaaatcg 1020 
agaaaataaa taataggttg ggcgtcacat 1080 
acgaaggaaa ttcaattgtt gctcatgaaa 1140 
tgattcataa tttgaataca caatttaacc 1200 
ttaatgaatt accaaatatt ttaaatgaaa 1260 
gcagtccaat tcaattacat ttaactacta 1320 
ctttagcaat aatgcaaaaa acattaaata 1380 
tacagcagtc tgagttataa aatagatatc 1440 
tgtagtgagg gggctggtat aatcacatag 1500 
caactggatc cggaggaaca gctatgacca 1560 
aacgtcgtga ctgggaaaac cctggcgtta 1620 
ctttcgccag ctggcgtaat agcgaagagg 1680 
gcagcctgaa tggcgaatgg cgctttgcct 17 40 
gctggctgga gtgcgatctt cctgaggccg 1800 
tgcacggtta cgatgcgccc atctacacca 1860 
cgtttgttcc cacggagaat ccgacgggtt 1920 
gctggctaca ggaaggccag acgcgaatta 1980 
tgtggtgcaa cgggcgctgg gtcggttacg 204 0 
acctgagcgc atttttacgc gccggagaaa 2100 
gtgacggcag ttatctggaa gatcaggata 2160 
tctcgttgct gcataaaccg actacacaaa 2220 
atgatgattt cagccgcgct gtactggagg 2280 
actacctacg ggtaacagtt tctttatggc 2340 
cgcctttcgg cggtgaaatt atcgatgagc 2400 
gtctgaacgt cgaaaacccg aaactgtgga 2460 
tggttgaact gcacaccgcc gacggcacgc 2520 
tccgcgaggt gcggattgaa aatggtctgc 2580 
gaggcgttaa ccgtcacgag catcatcctc 2640 
tggtgcagga tatcctgctg atgaagcaga 2700 
atccgaacca tccgctgtgg tacacgctgt 2760 
aagccaatat tgaaacccac ggcatggtgc 2820 
ggctaccggc gatgagcgaa cgcgtaacgc 2880 
gtgtgatcat ctggtcgctg gggaatgaat 2940 
atcgctggat caaatctgtc gatccttccc 3000 
acaccacggc caccgatatt atttgcccga 3060 
tcccggctgt gccgaaatgg tccatcaaaa 3120 
cgctgatcct ttgcgaatac gcccacgcga 3180 
actggcaggc gtttcgtcag tatccccgtt 3240 
atcagtcgct gattaaatat gatgaaaacg 3300 
ttggcgatac gccgaacgat cgccagttct 3360 
cgccgcatcc agcgctgacg gaagcaaaac 34 20 
ccgggcaaac catcgaagtg accagcgaat 34 80 
tgcactggat ggtggcgctg gatggtaagc 3540 
tcgctccaca aggtaaacag ttgattgaac 3600 
ggcaactctg gctcacagta cgcgtagtgc 3660 
ggcacatcag cgcctggcag cagtggcgtc 3720 
ccgcgtccca cgccatcccg catctgacca 3780 
gtaataagcg ttggcaattt aaccgccagt 3840 
ataaaaaaca actgctgacg ccgctgcgcg 3900 
acattggcgt aagtgaagcg acccgcattg 3960 
cggcgggcca ttaccaggcc gaagcagcgt 4 020 
atgcggtgct gattacgacc gctcacgcgt 4080 
gccggaaaac ctaccggatt gatggtagtg 4140 
tggcgagcga tacaccgcat ccggcgcgga 4200 
cagagcgggt aaactggctc ggattagggc 4260 
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cgcaagaaaa ctatcccgac 
tgtcagacat gtataccccg 
gcgaattgaa ttatggccca 
acagtcaaca gcaactgatg 
catggctgaa tatcgacggt 
cagtatcggc ggaattccag 
aaaaataata atgtcgaggg 
tgggaatttg taccccttat 
ccttggaagc tgtcagtagt 
actttccaaa tttacaaaag 
ctattaaaaa tagacaatac 
tgtttcattg cttgatgaaa 
cattttgaaa caaagtacgt 
cgggtaagtt ttattaagac 
agcttaagca attgctgaat 
atatccaagg tacgcttgta 
ctttcaaaaa ccactttttt 
attttatacc tctgtttgtt 
acacgagtat tcagttttaa 
atagacgtta cctgtttact 
ccaatttcgt aaacggtatc 
actttttcac tcgttaaaaa 
cttcttaatt acaaattttt 
aagatactgc actatcaaca 
ttttacgttt ccgccattct 
aacacgaacc gtcttatctc 
tttttctttt tgtgcgccta 
tcttatttta tcatcaacaa 
aatatataaa agccctcttt 
atatatccct tttctccgat 
attcctttta ccagggtttc 
tttttctttc taaaatgttc 
taaaacgatt tttttaatac 
ttttctgctt ctttttcttt 
ttaggtacag cttccagtaa 
tgattgcctt ctcttttacc 
gcactcgtta aaaggtgctg 
acaacaagct ctgatttcac 
ctccttttta ggtggcacaa 
cttaagttat tggtatgact 
atgtatcgtt agaaaaccga 
ccagtcatta ggcctatctg 
accatcacaa acagaatgat 
attaatgaat tttcctgctg 
taagttaaac ccagtaaatg 
tataggtgtt ttgggaaaca 
taaatcataa aactctttga 
agatacacca tcaaaaattg 
gtcgctattg taaccagttc 
aataaatgca gggtaaaatt 
atcaatttct gtgggtatac 
ttttctcttc caattgtcta 
ttttatctaa agtgaattta 
tttttaaaag tcaatattat 
caattttcgt ttgttgaact 
aatgtggtct tttgtgtttt 
tatcttgata ataagggtaa 
cccctttcat tttctaatgt 
aatccttttt cttattacgc 
cgccatgaca gccatgataa 
tgattaacaa ttattagagg 



cgccttactg ccgcctgttt 
tacgtcttcc cgagcgaaaa 
caccagtggc gcggcgactt 
gaaaccagcc atcgccatct 
ttccatatgg ggattggtgg 
ctgagcgccg gtcgctacca 
taccgagctc gaattccttt 
cgatacaaat tccccgtagg 
atacctaata atttatctac 
cgactcatag aattatttcc 
ttgctcataa gtaacggtac 
ctgattttta gtaaacagtt 
atatagcttc caatatttat 
actgtttact tttggtttag 
cgagacttga gtgtgcaaga 
gaatccttct tcaacaatca 
aataatttgt gtgcttaaat 
agggaattga aactgtagaa 
tttttctgac gataagttga 
tattttagcc agtttcgtcg 
ggtttctttt aaattcaatt 
gttttgagaa tattttatat 
agcatctaat ttaacttcaa 
cactcttaag tttgcttcta 
ttgctgtttc gatttttatg 
ccattatatc tttttttgca 
aatttcccac aatcactcac 
tcacaaatca cttgtgattt 
aaagggcttt tatgtttatt 
ttttacaacg gcattgtagg 
ttttccctct agtaattctt 
agctaaagta aacttacatt 
aatattgttg ccacacttag 
ctgttcctgg taatcagaaa 
atgaacaatg aattttttga 
gatttttttt aaatacgttt 
actttcaact gcctggcaca 
tttttgcatt ctacaaactg 
atgtgaggca ttttcgctct 
ggttttaagc gcaaaaaaag 
ctgtaaaaag tacagtcggc 
acaattcctg aatagagttc 
gtacctgtaa agatagcggt 
taataatggg tagaaggtaa 
aagtccatgg aataatagaa 
atttccccga accattatat 
agtcattctt tacaggagtc 
tataaagtgg ctctaactta 
taaaagctgt atttgagttt 
tatatccttc ttgttttatg 
taaaagtcgt ttggtggttc 
aatcaatttt attaaagttc 
ggaggcttac ttgtctgctt 
tgtaacataa atatatattt 
aatgggtgct ttagttgaag 
tttaaaggat ttgagcgtag 
ctattgccgg gatagactgt 
aaatctatta ccttattatt 
aaaatggccc gatttaagca 
ttactaatac taggagaagt 
tcatcgttca aaatggtatg 



tgaccgctgg gatctgccat 4320 
cggtctgcgc tgcgggacgc 4380 
ccagttcaac atcagccgct 44 40 
gctgcacgcg gaagaaggca 4500 
cgacgactcc tggagcccgt 4560 
ttaccagttg gtctggtgtc 4620 
tagtccagcc gagcgcttag 4 680 
cgctagggac ctctttagct 4740 
attcccttaa gtaacgtgta 4 800 
tcccgttaaa taatagataa 4860 
ttaaattgtt tactttggcg 4920 
gacgatattc tcgattgacc 4 980 
ctggaacatc tgtggtatgg 5040 
gatgaaagca ttccgctggc 5100 
gcaaccctag tgttcggtga 5160 
gatagatgtc agacgcatgg 5220 
ggtaaggaat actcccaaca 5280 
tatcttggtg aattaaagtg 5340 
atagatgact gtctaattca 5400 
ttaaatgccc tttacctgtt 54 60 
gttttattat ttggttgagt 5520 
ttttgttcat gtaatcactc 5580 
ttcctattat acaaaatttt 5640 
agtcttattt ccataacttc 5700 
atatggtgca agtcagcacg 5760 
ctgattggtg tatcatttcg 5820 
ttctttctat ttcttcttat 5880 
gtgataagtg atttgtgatt 5940 
ttgagaaaga tataaaatca 6000 
actttctatc tttcgttttg 6060 
ttacatttgt tttggtgagt 6120 
caggataatt tgaacaacca 6180 
gacattttcc tacaatactt 6240 
aatttagttt ttctatatcg 6300 
tattcgtaat aaagttctct 6360 
cccatttagc cgtcatttca 6420 
ataattttcc tttttcagtt 6480 
cataactcat atgtaaatcg 6540 
ttccggcgag gctagttacc 6600 
ttgctttttc gtacctatta 6660 
attatctcat attataaaag 6720 
ataaacaatc ctgcatgata 6780 
aaatatattg aattaccttt 6840 
ttactattat tattgatatt 6900 
agagaaaaag cattttcagg 6960 
ttctctacat cagaaaggta 7020 
caaataccag agaatgtttt 7080 
tcccaataac ctaactctcc 7140 
atcacccttg tcactaagaa 7200 
tttcggtata aaacactaat 7260 
aaataatgat taaatatctc 7320 
atttgatatg cctcctaaat 7380 
tcttcattag aatcaatcct 7440 
taaaaatatc ccactttatc 7500 
aataaaagac cacattaaaa 7560 
cgaaaaatcc ttttctttct 7620 
aacattctca cgcataaaat 7680 
aattcaattc gctcataatt 7740 
caccctttat tccgttaatg 7800 
taataaatac gtaaccaaca 7860 
cgttttgaca catccactat 7920 
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atatccgtgt cgttctgtcc actcctgaat 
aagtttctca gagtcggaaa gttgaccaga 
tgaaggaaga tctgattgct taactgcttc 
gatgcgatga tgcagaccaa tcaacatggc 
tggtagaaat gttgtcggtc cttgcacacg 
cagctcttct acgataaggg cacaaatcgc 
agcagtttga tacactttct ctaagtatcc 
aaattgacca tgtgtaagcg gccaatctga 
ctcttcgcta tcaaaattca cttccacctt 
ctctgcttcc tctgttgaca tgacacacat 
ctgacgacca agagagccat aaacaccaat 
tattcgttcc ttaatttcat gaacaatctt 
tccattcact attctcattc ccttttcaga 
tatttggaga gcaccgttct tattcagcta 
atccttttaa taacaattat agcatctaat 
ctctttaata aaataatttt tccgttccca 
ttcatcggct ttttcgtcat catctgtatg 
gtttaatttt ttatgtattt cttttaacaa 
taaaccttcc tccaaatcag acaaacgttt 
atccgtatcc tttacaggat attttgcagt 
atatttattt ttcggtcgat tttttattaa 
agcgtttatt tcgtttagtt atcggcataa 
agcccttgag cgtagcgtgg ctttgcagcg 
atgactgaat gaaataataa gcgcagcgtc 
agtttgaggg aatgaaattc cctcatgggt 
gagcggtagc gctggaaaat ttttgaaaaa 
aaggaagcga attttgcttc cgtactacga 
gtgccaaaaa cgctctatcc caactggctc 
gaatcgccaa cgttttcgcc aacgtttttt 
gtttttatga ttacaaagtg atacactaat 
aatggtgatt tcagaatcga aaaaaagagt 
aaaattaaca gatatggcga aacaaaaagg 
gctatagaag aatatgcaag aaaggaatca 
tttagaagga tacgagtttt cgctacttgt 
aaaaatacta aagctagaaa ttttggattt 
tggaaagaaa aattagagag tttgggcgta 
gacgaaaaaa aagataaaga tacatggaat 
tataaaaaac cacactatca cgttatatat 
gttaggaaca agattaagcg aaaattgggg 
gattatatca aaggttcata tgaatatttg 
aataaacata tatacgacaa aaaagatatt 
tatataacac ttgatgaaag ccaaaaaaga 
gatgactata atttggtaaa tacaaaagat 
gagtttggaa ttttaaatac gaatgatgta 
tttagattat ggtttgaggg caattatcag 
cttgatgctg aaacggggga aataaaatga 
atgaggaatt aaaaaaagaa attaaggact 
tggaagttga attaagtaca acaatagatt 
aagcccccct gacgaaagtc gaagggggct 
caatacgatt gcaataaaca aaatgatcga 
taaaacaac 



cccattccag aaattctcta gcgattccag 7980 
cattacgaac tggcacagat ggtcataacc 8040 
agttaagacc gaagcgctcg tcgtataaca 8100 
acctgccatt gctacctgta cagtcaagga 8160 
aatattacgc catttgcctg catattcaaa 8220 
atcgtggaac gtttgggctt ctaccgattt 8280 
acctgaatca taaatcggca aaatagagaa 8340 
ttccacctga gatgcataat ctagtagaat 84 00 
ccactcaccg gttgtccatt catggctgaa 84 60 
catctcaata tccgaatagg gcccatcagt 8520 
agccttaaca tcatccccat atttatccaa 8580 
cattctttct tctctagtca ttattattgg 8640 
taattttaga tttgcttttc taaataagaa 8700 
ttaataactc gtcttcctaa gcatccttca 8760 
cttcaacaaa ctggcccgtt tgttgaacta 8820 
attccacatt gcaataatag aaaatccatc 8880 
aatcaaatcg ccttcttctg tgtcatcaag 8940 
accaccatag gagattaacc ttttacggtg 9000 
caaattcttt tcttcatcat cggtcataaa 9060 
ttcgtcaatt gccgattgta tatccgattt 9120 
aacgtctcaa aatcgtttct gagacgtttt 9180 
tcgttaaaac aggcgttatc gtagcgtaaa 9240 
aagatgttgt ctgttagatt atgaaagccg 9300 
cttctatttc ggttggagga ggctcaaggg 9360 
ttgattttaa aaattgcttg caattttgcc 9420 
aatttggaat ttggaaaaaa atggggggga 94 80 
ccccccatta agtgccgagt gccaattttt 9540 
aagggtttga ggggtttttc aatcgccaac 9600 
ataaatctat atttaagtag ctttattgtt 9660 
tttataaaat tatttgattg gagtttttta 9720 
tatgatttct ctgacaaaag agcaagataa 9780 
tttttcaaaa tctgcgggtt gcgggcgtta 9840 
gaacaaaaaa aataagcgaa agctcgcgtt 9900 
ttttgataag gtaattatat catggctatt 9960 
ttattatatc ctgactcaat tcctaatgat 10020 
tctatggctg tcagtccttt acacgatatg 10080 
agtagtgatg ttatacgaaa tggaaagcac 10140 
attgcacgaa atcctgtaac aatagaaagc 10200 
aatagttcag ttgctcatgt tgagatactt 10260 
actcatgaat caaaggacgc tattgctaag 10320 
ttgaacatta atgattttga tattgaccgc 10380 
gaattgaaga atttactttt agatatagtg 10440 
ttaatggctt ttattcgcct taggggagcg 10500 
aaagatattg tttcaacaaa ctctagcgcc 10560 
tgtggatata gagcaagtta tgcaaaggtt 10620 
caaacaaaga aaaagagtta tttgctgaaa 10680 
taaaagagcg tattgaaaga tacagagaaa 10740 
tattgagagg agggattatt gaataaataa 10800 
tttattttgg tttgatgttg cgattaatag 10860 
tgaaagcaat gtgagagaaa aatatgttcc 10920 

10929 



<210> 8 
<211> 12739 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Vector pEPEFl 
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<400> 8 

aagcttgcct gcaggcgacg ataaccacat 
actttatcac aagaaatatt ttggcattct 
tttataatat ataattgatt atagaataat 
ctcaaattac actatgttac acttggtaat 
taagaggtaa taataaatgg aaaataattt 
aaaacaaatt ttcaataaca gcaatatttc 
taataaagct actatttcta acattctgaa 
agtaggagaa ggtaatagta ctaaaagtgg 
ccaaaaatat ggctactata tttctatgga 
caactacttt gatgctacta tattaaagca 
aagcagtata ttacaaattt taaaatctaa 
atatgggtta cttggtatat ctatatccat 
aatcaatctt ccttttcata aaaatgagaa 
cacaaatgtt cctgtcgtta tagaaaatga 
tttatatatt aattcaaaca taaataattt 
cgctggcatc ataataaata aaaaacttta 
aggtaagaca ttggttttgg aatctataaa 
agatatatgc tcccaagacg ctttaataca 
gacgtttaca gaactaatcc aatattacaa 
taaacaattt attaataaaa tgacagttct 
agacgctatt tatattaact gtcctttaat 
taaagagcaa ttctcctgtt tttctcaagg 
tgtaaaacaa gctactttat tgggtggcac 
aaataacatt caaatgaata ttaaataatt 
cggaccgctt tggcagttta ttcttgacat 
actgttctcg agagttagtt tgtttattaa 
gggtaacttt gaaaggatat tcctcatgca 
agtcttccgg gtcaaccaat tacctgcaca 
agaattcaaa acgggtagtc gcttcatcaa 
cgccaagaca ccggctgaac gcccagttga 
ctttgatacg attcaagttc ccggtcatat 
cattaacacg ctatacccat gggaaggtaa 
tcaagatcaa ttaacaccag gcctattcag 
cctcaaaacc ttcgatctcg acgatgtttt 
aggggtagaa gaagccctgt acgtctggtt 
tagtttcacc ccttcagaat ttgatttgac 
agcggttcgg gtctacaaac acagtactgc 
tttctctggt attttccgtg acgtcaatat 
tttggacatc cgaccagttc caaatgccaa 
taaagtaacc ggcgaaccag ccactttagc 
actgacgagt caaacgcaaa ccggtagtgg 
ccaactgcac ttgtggtcac cacaaacgcc 
cgatgctgat caccaactct tggaagtcgt 
gctgcgcgat gacaaagtca tttacgtcaa 
ccggcacgaa tggaacgccc acaccggtcg 
tatccaaacc atgttagcta acaatatcaa 
attaccttgg tatcaattat gtgacgaggc 
cgaatcgcac gggtcatggc aaaagatggg 
cgataatcca cactggccag cagcggtgat 
taaaaaccac ccatcaatca ttttttggtc 
tatcgcggcg atgcaggctt tttataaaga 
aggcgttttc tacacaccag aattaaaaga 
cgaaaagccc caaaatattg tagcttactt 
ttgtgaatat atgcatgaca tggggaattc 
gatcgacaag tatccaatgt atcaaggtgg 
cttcgttcat gacccaatca ccgaccaaga 
acgccactcc gattatgcat tctccggtaa 
accagcaatg caagaggtga aatattatta 
tttacggcga cgggagctta ggactacagg 



aacagtcata aaaccactcc tttttaacaa 60 
acgactataa cttaaattta tattttttac 120 
gttgctcata tcgtttgcca acatctagta 180 
attaaccgaa cttcccctgt ccaaattaga 240 
tatagtaaat gaaaatgaga agcgtgtatt 300 
acgaacacaa atatcgaaga atttagaact 360 
caacttaaaa cacaagagtt tagttaatga 420 
tggacgaaag cctattttac tcgaaattaa 480 
tttaacatat gattccgttg aattaatgta 540 
agattcctac gaattaaatg ataaaaatgt 600 
tataaacgtc tcagaaaaat atgatacgtt 660 
acacggtatc gttgacgatg agcaaaacat 720 
acgcacattt accgatgaat taaagtcatt 780 
agcaaattta tcagcgctat atgaaaaaag 840 
gattacttta agtattcaca agggtatagg 900 
tcgtggctca aatggagagg ctggagagat 960 
taacaatgac aacaaatatt ataaaatcga 1020 
gaaaataaat aataggttgg gcgtcacatt 1080 
cgaaggaaat tcaattgttg ctcatgaaat 1140 
gattcataat ttgaatacac aatttaaccc 1200 
taatgaatta ccaaatattt taaatgaaat 12 60 
cagtccaatt caattacatt taactactaa 1320 
tttagcaata atgcaaaaaa cattaaatat 1380 
acagcagtct gagttataaa atagatatct 14 40 
gtagtgaggg ggctggtata atcacatagt 1500 
attaaccaac tggatcccgg gtctagatta 1560 
agctaatctt caatggttag atgacccaga 1620 
tagtgatcac cattattatc acgacacagc 1680 
gagtctcaat ggcgcttggc gttttaactt 1740 
tttttatcaa cccgatttcg atgcaaccga 1800 
tgaactagcc ggctatggtc aaattcaata 18 60 
aatttaccgt cgcccaccgt ataccctcaa 1920 
cgacgctgcg gacaacaccg tcggctcgta 1980 
taaagggcaa cgtattatca ttcagttcca 2040 
aaatggccat tttattggct actctgaaga 2100 
gccgtatatt caggaccaag gtaacgtttt 2160 
tgcctttatt gaagaccaag atatgttccg 2220 
actggcggag cctgctagcc atattactga 2280 
tctcaaaagt ggtgagctca acatcactac 2340 
gctgaccgtt aaagaccatg acgggcgagt 24 00 
cagtgtaacc tttgatacta tgttattcga 24 60 
gtatctctat caattgacaa ttgaagttta 2520 
cccatatcag tttgggttcc ggacggtcga 2580 
caataaacgg ttggtgatca acggggttaa 2640 
cgttatcagt atggctgata tgcgcgctga 2700 
tgccgatcgg acctgccatt atcctgatca 27 60 
cggtatctac ctaatggccg aaaccaacct 2820 
ggctatcgag ccttcttaca atgttcctgg 2880 
cgaccgggcc cgttcaaact acgaatggtt 2940 
acttggcaat gaatcgtatg ctggcgaaga 3000 
acacgatgat tcacgactcg tccactacga 3060 
tcgcatttct gatgttgaaa gtcggatgta 3120 
ggaagataac ccaaccaaac ctttcctaaa 3180 
tctgggcggt atgcaatcat ataatgattt 3240 
ctttatttgg gactttattg atcaagcact 3300 
cgtgctccgg tatggcggtg atttcgacga 3360 
cggcttaatg tttgccgacc ggacaccaaa 3420 
tggcttacac aaataatcaa ctacacgtta 34 80 
gggctaattt ccactacctc tttagctacg 3540 
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aacgtggggg acttgaatca ctcgtcgtca 
cgcccatgtt ttggcgggcg acaaccgata 
ccgcacagtg gtacgcggcc gataagttct 
acgaccaacc agtcacaccg ttaccaatcg 
aaatcgccac gaaagtctct ctggcttacc 
tcgtcacagt gacttatacg gtgacagcag 
gcggtcagtc tgatttgcca gagctacccg 
ccgcgaccgg cttcgactat accggtttgt 
gcgcaacaca cgggcaattc cacgttgaca 
aagaatgcgg catgcacatg caaactgaac 
ataacgctga ccacgacaac acaccattca 
tcgccttcag ctgccttccc tataccgctg 
aattaccatt agcacggcga acggtcttat 
gcattgacag ttggggaacg gacgtagaag 
ttgacttcag ctttaacatt catttctaaa 
gcagttattt gccagagcgt tcttttaata 

gtgggggggg gggggggggg gggggggggg 
gggggggggg gggggggggg gggggggggg 

gggggggggg gggggggggg gggggggggg 
gggggggggg gggggggggg gggggggggg 
gggggggggg gggggggggg gggggggggg 
gggggggggg gggggggggg gggggggggg 
gggggggggg gggggggggg gggggggggg 
gggggggggg gggggggggg gggggggggg 
gggggggggg gggggggggg gggggggggg 
gggggggggg gggggggggg gggggggggg 
gggggggggg gggggggggg gggggggggg 
gggggggggg gggggggggg gggggggggg 
gggggggggg gggggggggg gggggggggg 
gggggggggg gggggggggg gggggggggg 
gggggggggg ggggtcgacc gatgcccttg 
tgggcgcggg gcatgactat cgtcgccgca 
gtaggacagg tgccggcagc gctctgggtc 
gcgacgatga tcggcctgtc gcttgcggta 
ttcgtcactg gtcccgccac caaacgtttc 
gcggccgacg cgctgggcta cgtcttgctg 
cccattatga ttcttctcgc ttccggcggc 
tccaggcagg tagatgacga ccatcaggga 
agcctaactt cgatcactgg accgctgatc 
acatggaacg ggttggcatg gattgtaggc 
ttgcgtcgcg gtgcatggag ccgggccacc 
ctaacggatt caccactcca agaattggag 
cgcaaaccaa cccttggcag aacatatcca 
ggcgcatctc gggcagcgtt gggtcctggc 
tgaggacccg gctaggctgg cggggttgcc 
gcgagcgaac gtgaagcgac tgctgctgca 
tggtcttcgg tttccgtgtt tcgtaaagtc 
gaagttgccc gcaacagaga gtggaaccaa 
tcaacgccat gagcggcctc atttcttatt 
gtagctcctt ccggtgggcg cggggcatga 
ttatcatgca actcgtagga caggtgccgg 
ctgccaccat acccacgccg aaacaagcgc 
caggatgctg ctggctaccc tgtggaacac 
ttttatcagg ctctgggagg cagaataaat 
gagagcctga gcaaactggc ctcaggcatt 
agtcaataaa ccggtaaacc agcaatagac 
accgacgacc gggtcgaatt tgctttcgaa 
attcaggcgt agcaccaggc gtttaagggc 
cgccctgcca ctcatcgcag tactgttgta 
catcacagac ggcatgatga acctgaatcg 
tataatattt gcccatggtg aaaacggggg 



acgataaaga gtggctctat cgtacaccca 3600 
atgatcacgg tagcggcttt tcagtcaaat 3660 
caacttgtca agatatcgaa ttgacggttg 3720 
cgccactcaa taacaaatac acggatcacg 3780 
acttcgttac cacgaccgtt cctagtacca 3840 
acggtcagat caatatcgcc acccattata 3900 
catttggtct gcggtttatc atgccaacta 3960 
ccggtgagac ttatcctgac cggctggctg 4020 
gtctgccagt cacaccatac ttggtcccac 4080 
aagtgacagt aacgcgatca acaacacaaa 4140 
gtttaacatt tagccaaacc gatgcaccat 4 200 
ctgaactaga aaacgcaaca cacatggaag 4260 
caatctacgg tgccgttcgt ggggtcggtg 4 320 
ccccatatca tatcctcgct aatcaagaca 4380 
aattaatttg atttcaaaag aatcgctccg 4 440 
tgttttatgg ctgagcttta gtcctttgaa 4500 
gggggggggg gggggggggg gggggggggg 4 560 
gggggggggg gggggggggg gggggggggg 4 620 

gggggggggg gggggggggg gggggggggg 4 680 
gggggggggg gggggggggg gggggggggg 47 4 0 

gggggggggg gggggggggg gggggggggg 4800 
gggggggggg gggggggggg gggggggggg 48 60 

gggggggggg gggggggggg gggggggggg 4 920 

gggggggggg gggggggggg gggggggggg 4 980 

gggggggggg gggggggggg gggggggggg 5040 

gggggggggg gggggggggg gggggggggg 5100 

gggggggggg gggggggggg gggggggggg 5160 

gggggggggg gggggggggg gggggggggg 5220 
gggggggggg gggggggggg gggggggggg 5280 
gggggggggg gggggggggg gggggggggg 534 0 
agagccttca acccagtcag ctccttccgg 5400 
cttatgactg tcttctttat catgcaactc 5460 
attttcggcg aggaccgctt tcgctggagc 5520 
ttcggaatct tgcacgccct cgctcaagcc 5580 
ggcgagaagc aggccattat cgccggcatg 5640 
gcgttcgcga cgcgaggctg gatggccttc 5700 
atcgggatgc ccgcgttgca ggccatgctg 57 60 
cagcttcaag gatcgctcgc ggctcttacc 5820 
gtcacggcga tttatgccgc ctcggcgagc 5880 
gccgccctat accttgtctg cctccccgcg 5940 
tcgacctgaa tggaagccgg cggcacctcg 6000 
ccaatcaatt cttgcggaga actgtgaatg 6060 
tcgcgtccgc catctccagc agccgcacgc 6120 
cacgggtgcg catgatcgtg ctcctgtcgt 6180 
ttactggtta gcagaatgaa tcaccgatac 624 0 
aaacgtctgc gacctgagca acaacatgaa 6300 
tggaaacgcg gaagtcccct acgtgctgct 6360 
ccggtgatac cacgatacta tgactgagag 6420 
ctgagttaca acagtccgca ccgctgtccg 6480 
ctatcgtcgc cgcacttatg actgtcttct 6540 
cagcgcccaa cagtcccccg gccacggggc 6600 
cctgcaccat tatgttccgg atctgcatcg 6660 
ctacatctgt attaacgaag cgctaaccgt 6720 
gatcatatcg tcaattatta cctccacggg 6780 
tgagaagcac acggtcacac tgcttccggt 684 0 
ataagcggct atttaacgac cctgccctga 6900 
tttctgccat tcatccgctt attatcactt 6960 
accaataact gccttaaaaa aattacgccc 7020 
attcattaag cattctgccg acatggaagc 7080 
ccagcggcat cagcaccttg tcgccttgcg 7140 
cgaagaagtt gtccatattg gccacgttta 7200 



-7- 



WO 02/051982 



PCT/US01750250 



aatcaaaact ggtgaaactc acccagggat 
accctttagg gaaataggcc aggttttcac 
gtagaaactg ccggaaatcg tcgtggtatt 
gctcatggaa aacggtgtaa caagggtgaa 
tcattgccat acgaattcag aggtttgatg 
aaaataaaaa aaggcgaagc ctattatata 
tttgcgtgaa aaaaaaggca gtgttttcgc 
cgatgtatac agtcacaaaa atcgattttt 
caaaaatcga tgtacacagt acgacttttg 
gcatagagaa aactcactat gaaatgactt 
agcgtattta tactatgatt ataccagaaa 
tgaatgaatt agaaaaacgc aaagtagtgg 
aaatgcaaaa aacagcttta aaaatgtttg 
aacccccaaa aaataatact gtttatcttt 
tatcgtctag cagtaagcat agtcaattca 
cctttttcaa tattaaagct gataagaaac 
ttccttatgt aaaatggaac gattataacg 
ttatgcccta tctgattgat ttaaaggcag 
aaaaactcaa tagcaagtat tcgattattt 
aatacgaaca ttacagcgtg aaagggggac 
acccctcaat aaaagtgaaa gaattgcgag 
attttcccca ttttgaaact agagtattaa 
cctcttttaa tgtaacctat gagaaaaaga 
ttcatattga gaagaaacgc atggcagacg 
atcaagacga taaaaagcaa aaatcaagaa 
aaagtaaata tacaagactt ttatctgaaa 
atacagccac aatggtaggc ttacagaaga 
aattaagagg gctaaatggt gtcaaagacc 
cctattctaa acgtaatgta gcgaaatatc 
cggttaaaag gcaggactta aaccatgagt 
gacgagctag gactagctaa acagaccgtt 
aatcgacaaa aaaacaatag tggttcgatc 
aaaaaaagac taacaaaaat aaaaaaagtt 
aagaattaag cgttaatagt cttttgttag 
aagaatgtga ataaaggccg gataaaactt 
ggccgtaata tccagctgaa cggtctggtt 
ctcaaaatgt tctttacgat gccattggga 
tttctccatt ttagcttcct tagctcctga 
tagtgatctt atttcattat ggtgaaagtt 
ttttcgccaa aagttggccc agggcttccc 
ttctgcgaag tgatcttccg tcacaggtat 
tgccaactta ctgatttagt gtatgatggt 
ctatcagctg tccctcctgt tcagctactg 
ccggacatca gcgctagcgg agtgtatact 
cagtgaagtg cttcatgtgg caggagaaaa 
tgatacagga tatattccgc ttcctcgctc 
cggcgagcgg aaatggctta cgaacggggc 
cttaacaggg aagtgagagg gccgcggcaa 
acaagcatca cgaaatctga cgctcaaatc 
gataccaggc gtttccccct ggcggctccc 
ttaccggtgt cattccgctg ttatggccgc 
tccgggtagg cagttcgctc caagctggac 
cgctgcgcct tatccggtaa ctatcgtctt 
ccactggcag cagccactgg taattgattt 
ttaaggctaa actgaaagga caagttttgg 
gttcaaagag ttggtagctc agagaacctt 
gttttcagag caagagatta cgcgcagacc 
tcagataaaa tatttctaga tttcagtgca 
cagccccata cgatataagt tgtaattctc 
ccccgtaggc gctagggacc tctttagctt 
tttatctcca ttccctttag taacgtgtaa 



tggctgagac gaaaaacata ttctcaataa 7260 
cgtaacacgc cacatcttgc gaatatatgt 7320 
cactccagag cgatgaaaac gtttcagttt 7380 
cactatccca tatcaccagc tcaccgtctt 7440 
actttgacct ttggttttaa atttttgaaa 7500 
tttatcatat atattttaat cttttgttct 7560 
tagttataga aattaaacag tcacaaaaat 7620 
gtgactctat gcatcgatgt atacgagtca 7 680 
tatttgtgta ctgtatatag tataataaaa 7740 
tctctatgct actactaaaa cacgcaaagg 7800 
aacaaaacaa acaaaagcag gtgctgacct 78 60 
agcataacgc tttaattcaa agtgtggcta 7920 
agttagccgt gtcttgtatt gataccgaag 7980 
taaaatctga actttttaaa ttttttgaag 8040 
aagaagcagt taattatatg caaaagcaag 8100 
taggtattga gtatgaaagc attgtgccta 8160 
atgaagtaac aatacgtttt gaccaagcca 8220 
aatttactca atataaaatt tccgaactac 8280 
tgtatcgttg gttatccatg aattataacc 8340 
ggagagttga acaagtggaa tcctaccgca 8400 
aaataactga tacaataaat gaacatcaac 8460 
aaaaagcaat tgaagaaatc aacgctcaca 8520 
aagcagggcg gagcattgac tctattgtct 8580 
ataacagtta caagttggga gataaagact 8640 
atgaagctga cttactaaaa caggcaatgg 8700 
atttcttaat tggcatgaat gatattatgg 8760 
atgtttatcc gctttatgac gaactgaaag 8820 
acttgtctta tgttgccagc aaaaaagaag 8880 
ttaaaaaagc cattgaacaa tatctgccaa 8940 
gaagacttaa aaacgataaa agagttggcg 9000 
caataccatt taaagttttt gccgaccgaa 9060 
gtcctaaacc ttgaagaaca aaagtttatc 9120 
caaccgaccg aaaatcgaca agaaaaaatg 9180 
gaattccgga tgagcattca tcaggcgggc 9240 
gtgcttattt ttctttacgg tctttaaaaa 9300 
ataggtacat tgagcaactg actgaaatgc 9360 
tatatcaacg gtggtatatc cagtgatttt 9420 
aaatctcgat aactcaaaaa atacgcccgg 9480 
ggaacctctt acgtgccgat caacgtctca 954 0 
ggtatcaaca gggacaccag gatttattta 9600 
ttattcggcg caaagtgcgt cgggtgatgc 9660 
gtttttgagg tgctccagtg gcttctgttt 9720 
acggggtggt gcgtaacggc aaaagcaccg 9780 
ggcttactat gttggcactg atgagggtgt 9840 
aaggctgcac cggtgcgtca gcagaatatg 9900 
actgactcgc tacgctcggt cgttcgactg 9960 
ggagatttcc tggaagatgc caggaagata 10020 
agccgttttt ccataggctc cgcccccctg 10080 
agtggtggcg aaacccgaca ggactataaa 1014 0 
tcgtgcgctc tcctgttcct gcctttcggt 10200 
gtttgtctca ttccacgcct gacactcagt 102 60 
tgtatgcacg aaccccccgt tcagtccgac 10320 
gagtccaacc cggaaagaca tgcaaaagca 10380 
agaggagtta gtcttgaagt catgcgccgg 10440 
tgactgcgct cctccaagcc agttacctcg 10500 
cgaaaaaccg ccctgcaagg cggttttttc 10560 
aaaacgatct caagaagatc atcttattaa 10620 
atttatctct tcaaatgtag cacctgaagt 10680 
atgtttgaca gcttatcatc gatacaaatt 107 4 0 
cttggaagct gtcagtagta tatctaataa 10800 
ctttccaaat ttaaaaaagc gactcataga 10860 
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attatttcct cccgttaaat aatagataac tattaaaaat agacaatact tgctcataag 10920 
taatggtact taaattgttt actttggcgt gtttcattgc ttgatgaaac tgatttttag 10980 
taaacagttg acgatattct cgattgaccc attttgaaac aaagtacgta tatagcttcc 11040 
aatatttatc tggaacatct gtggtatggc gggtaagttt tattaagaca ctgtttactt 11100 
ttggtttagg atgaaagcat tccgctggca gcttaagcaa ttgctgaatc gagacttgag 11160 
tgtgcaagag caaccctagt gttcggtgaa tatccaaggt acgcttgtag aatccttctt 11220 
caacaatcag atagatgtca gacgcacggc tttcaaaaac cactttttta ataatttgtg 11280 
tgcttaaatg gtaaggaata ttcccaacaa ttttatacct ctgtttgtta gggaattgaa 11340 
actgtagaat atcttggtga attaaagtga cacgagtatt cagttttaat ttttctgacg 11400 
ataagttgaa tagatgactg tctaattcaa tagacgttac ctgtttactt attttagcca 11460 
gtttcgtcgt taaatgccct ttacctgttc caatttcgta aacggtatcg gtttctttta 11520 
aattcaattg ttttattatt tggttgagta ctttttcact cgttaaaaag ttttgagaat 11580 
attttatatt tttgttcatg taatctctcc tgaagtgatt acatctataa acaaatacag 11640 
aagttaaacg atttgtttgt aattttagtt atctgtttaa aaagtcataa gattagtcac 11700 
tggtaggaat taatctaacg tatttattta tctgcgtaat cactgttttt agtctgtttc 11760 
aaaacagtag atgttttatc tacattacgc atttggaata ccaacatgac gaatccctcc 11820 
ttcttaatta caaattttta gcatctaatt taacttcaat tcctattata caaaatttta 11880 
agataatgca ctatcaacac actcttaagt ttgcttctaa gtcttatttc cataacttct 11940 
tttacgtttc cgccattctt tgctgtttcg atttttatga tatggtgcaa gtcagcacga 12000 
acacgaaccg tcttatctcc cattatatct ttttttgcac tgattggtgt atcatttcgt 12060 
ttttcttttt gtgcgcctaa atttcccaca atcactcact tctttctatt tcttcttatt 12120 
cttattttat catcaacaat cacaaatcac ttgtgatttg tgataagtga tttgtgataa 12180 
gtgatttgtg attaatatat aaaagccctc tttaaagggc ttttatgttt attttgagaa 12240 
agatataaaa tcaatatatc ccttttcccc aatttttaca acggcattgt agggcttttt 12300 
ctctttgttt ttgatttcct tttaccaggg tttcttttcc ctccagtaat tcttttacat 12360 
tcgttttggt tagttttttc tttctaaaat gttcagctaa agtaaactta cattcaggat 12420 
aatttgaaca accataaaac gattttttta atacaatatt gttgccacac ttaggacatt 12480 
ttcctacaat acttttttct gcttcttttt ctttctgttc ctggtaatca gaaaaattta 12540 
gtttttctat atcgttaggt acagcttcca gtaaatgaac aatgaatttt ttgatattcg 12600 
taataaagtt ctcttgattg ccttctcttt taccgatttt ttttaaatac gtttcccatt 12660 
tagccgtcat ttcagcactc gttaaaaggt gctgactttc aactgcctgg cacaataatt 12720 
ttcctttttc agttacaac 12739 



<210> 9 
<211> 57 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide P59 
<400> 9 

gaccgacatt aaattcttga cagggagaga taggtttgat agaatataat agttgtc 57 

<210> 10 
<211> 58 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide P59 complement 
<400> 10 

tcgagacaac tattatattc tatcaaacct atctctccct gtcaagaatt taatgtcg 58 

<210> 11 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Lac operator sequence 
<400> 11 

ttgttatccg ctcacaa 17 

<210> 12 
<211> 94 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide Xyl/Lac-T5 
<400> 12 

gaccgtcata aaaaatttat ttgctttgtg agcggataac aattataata gattcaagtt 60 
agtttgttta ttaaattaac caactaaaat gtag 94 

<210> 13 
<211> 95 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide Xyl/Lac-T5 complement 
<400> 13 

aattctacat tttagttggt taatttaata aacaaactaa cttgaatcta ttataattgt 60 
tatccgctca caaagcaaat aaatttttta tgacg 95 

<210> 14 
<211> 57 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide Xyl/Lac-P59 
<400> 14 

gaccgacatt aaattcttga cattgtgagc ggataacaat agaatataat agttgtc 57 

<210> 15 
<211> 58 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide Xyl/Lac P59 complement 
<400> 15 

tcgagacaac tattatattc tattgttatc cgctcacaat gtcaagaatt taatgtcg 58 

<210> 16 
<211> 6852 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Vector pXyl-T5 pl5a 
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<400> 16 

tggcattcta cgactataac ttaaatttat 
tagaataatg ttgctcatat cgtttgccaa 
cttggtaata ttaaccgaac ttcccctgtc 
aaataatttt atagtaaatg aaaatgagaa 
caatatttca cgaacacaaa tatcgaagaa 
cattctgaac aacttaaaac acaagagttt 
taaaagtggt ggacgaaagc ctattttact 
ttctatggat ttaacatatg attccgttga 
attaaagcaa gattcctacg aattaaatga 
aaaatctaat ataaacgtct cagaaaaata 
tatatccata cacggtatcg ttgacgatga 
aaatgagaaa cgcacattta ccgatgaatt 
agaaaatgaa gcaaatttat cagcgctata 
aaataatttg attactttaa gtattcacaa 
aaaactttat cgtggctcaa atggagaggc 
atctataaat aacaatgaca acaaatatta 
tttaatacag aaaataaata ataggttggg 
atattacaac gaaggaaatt caattgttgc 
gacagttctg attcataatt tgaatacaca 
tcctttaatt aatgaattac caaatatttt 
ttctcaaggc agtccaattc aattacattt 
gggtggcact ttagcaataa tgcaaaaaac 
taaataatta cagcagtctg agttataaaa 
tttgctttca ggaaaatttt tctgtataat 
ccaactaaaa tgtagaattc gagctcggta 
caagcttggg cttttcagcc tgatacagat 
acagaatttg cctggcggca gtagcgcggt 
agtgaaacgc cgtagcgccg atggtagtgt 
ccaggcatca aataaaacga aaggctcagt 
gtttgtcggt gaacgctctc ctgagtagga 
cgaagcaacg gcccggaggg tggcgggcag 
ttaagcagaa ggccatcctg acggatggcc 
atttttctaa atacattcaa atatgtatcc 
tcaaacatga gaattaaatc aatctaaagt 
caatgcttaa tcagtgaggc acctatctca 
gcctgactcc ccgtcgtgta gataactacg 
gctgcaatga taccgcgaga cccacgctca 
ccagccggaa gggccgagcg cagaagtggt 
attaattgtt gccgggaagc tagagtaagt 
gttgccattg ctacaggcat cgtggtgtca 
tccggttccc aacgatcaag gcgagttaca 
agctccttcg gtcctccgat cgttgtcaga 
gttatggcag cactgcataa ttctcttact 
actggtgagt actcaaccaa gtcattctga 
tgcccggcgt caacacggga taataccgcg 
attggaaaac gctcttcggg gcgaaaactc 
tcgatgtaac ccactcgtgc acccaactga 
tctgggtgag caaaaacagg aaggcaaaat 
aaatgttgaa tactcatact cttccttttt 
tgtctcatga gcggatacat atttgaatgt 
cgcacatttc cccgaaaagt gccacctggc 
tttggtctgc gcgtaatctc ttgctctgaa 
tcgaaggttc tctgagctac caactctttg 
tcaccaaaac ttgtcctttc agtttagcct 
tctaaatcaa ttaccagtgg ctgctgccag 
ctcaagacga tagttaccgg ataaggcgca 
acagtccagc ttggagcgaa ctgcctaccc 
aacgcggcca taacagcgga atgacaccgg 
cgagggagcc gccaggggaa acgcctggta 
ctgatttgag cgtcagattt cgtgatgctt 



attttttact ttataatata taattgatta 60 
catctagtac tcaaattaca ctatgttaca 120 
caaattagat aagaggtaat aataaatgga 180 
gcgtgtatta aaacaaattt tcaataacag 240 
tttagaactt aataaagcta ctatttctaa 300 
agttaatgaa gtaggagaag gtaatagtac 360 
cgaaattaac caaaaatatg gctactatat 420 
attaatgtac aactactttg atgctactat 480 
taaaaatgta agcagtatat tacaaatttt 540 
tgatacgtta tatgggttac ttggtatatc 600 
gcaaaacata atcaatcttc cttttcataa 660 
aaagtcattc acaaatgttc ctgtcgttat 720 
tgaaaaaagt ttatatatta attcaaacat 780 
gggtataggc gctggcatca taataaataa 840 
tggagagata ggtaagacat tggttttgga 900 
taaaatcgaa gatatatgct cccaagacgc 960 
cgtcacattg acgtttacag aactaatcca 1020 
tcatgaaatt aaacaattta ttaataaaat 1080 
atttaaccca gacgctattt atattaactg 1140 
aaatgaaatt aaagagcaat tctcctgttt 1200 
aactactaat gtaaaacaag ctactttatt 1260 
attaaatata aataacattc aaatgaatat 1320 
tagatatctc ggaccgtcat aaaaaattta 1380 
agattcaagt tagtttgttt attaaattaa 1440 
cccggggatc ctctagagtc gacctgcagc 1500 
taaatcagaa cgcagaagcg gtctgataaa 1560 
ggtcccacct gaccccatgc cgaactcaga 1620 
ggggtctccc catgcgagag tagggaactg 1680 
cgaaagactg ggcctttcgt tttatctgtt 1740 
caaatccgcc gggagcggat ttgaacgttg 1800 
gacgcccgcc ataaactgcc aggcatcaaa 1860 
tttttgcgtt tctacaaact ctttttgttt 1920 
gctgatcccc atcctatcga tgataagctg 1980 
atatatgagt aaacttggtc tgacagttac 2040 
gcgatctgtc tatttcgttc atccatagtt 2100 
atacgggagg gcttaccatc tggccccagt 2160 
ccggctccag atttatcagc aataaaccag 2220 
cctgcaactt tatccgcctc catccagtct 2280 
agttcgccag ttaatagttt gcgcaacgtt 2340 
cgctcgtcgt ttggtatggc ttcattcagc 24 00 
tgatccccca tgttgtgcaa aaaagcggtt 2460 
agtaagttgg ccgcagtgtt atcactcatg 2520 
gtcatgccat ccgtaagatg cttttctgtg 2580 
gaatagtgta tgcggcgacc gagttgctct 2640 
ccacatagca gaactttaaa agtgctcatc 2700 
tcaaggatct taccgctgtt gagatccagt 2760 
tcttcagcat cttttacttt caccagcgtt 2820 
gccgcaaaaa agggaataag ggcgacacgg 2880 
caatattatt gaagcattta tcagggttat 2940 
atttagaaaa ataaacaaat aggggttccg 3000 
ggccgcataa gatgatcttc ttgagatcgt 3060 
aacgaaaaaa ccgccttgca gggcggtttt 3120 
aaccgaggta actggcttgg aggagcgcag 3180 
taaccggcgc atgacttcaa gactaactcc 3240 
tggtgctttt gcatgtcttt ccgggttgga 3300 
gcggtcggac tgaacggggg gttcgtgcat 3360 
ggaactgagt gtcaggcgtg gaatgagaca 3420 
taaaccgaaa ggcaggaaca ggagagcgca 3480 
tctttatagt cctgtcgggt ttcgccacca 3540 
gtcagggggg cggagcctat ggaaaaacgg 3600 
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ctttgccgcg gccctctcac ttccctgtta 
ccgccccgtt cgtaagccat ttccgctcgc 
gtgagcgagg aagcggaata tatcctgtat 
ttttttctcc tgccacatga agcacttcac 
ccagtgcggc cgcttctttc ctgcgttatc 
ctttgagtga gctgataccg ctcgccgcag 
cgaggaagcg gaagagcgcc tgatgcggta 
acaccgcata agatcccctc gacctgcagg 
aaactatatg atttacccct aaatctttaa 
ttaaaattta aatatttctt gtgataaagt 
atgtggttat cgattatagg tatgtggttt 
gttattaaaa ataaaaatta gaccacgcat 
aagaaccctt aactaaactt gcagacgaat 
attcgacaag ttttgggatt gttaagggtt 
ataaagaagc gaaaaaggag aagtcggttc 
aatataaaaa ccttcttcaa ctaacggggc 
aaagtacagt cggcattatc tcatattata 
cctgaataga gttcataaac aatcctgcat 
gtaaagatag cggtaaatat attgaattac 
tgggtagaag gtaattacta ttattattga 
atggaataat agaaagagaa aaagcatttt 
ccgaaccatt atatttctct acatcagaaa 
tctttacagg agtccaaata ccagagaatg 
gtggctctaa cttatcccaa taacctaact 
ctgtatttga gtttatcacc cttgtcacta 
cttcttgttt tatgtttcgg tataaaacac 
tcgtttgttg gttcaaataa tgattaaata 
ttttattaaa gttcatttga tatgcctcct 
ttacttgtct gctttcttca ttagaatcaa 
ataaatatat attttaaaaa tatcccactt 
tgctttagtt gaagaataaa gaccacatta 
gatttgagcg tagcgaaaaa tccttttctt 
cggcgaggct agttaccctt aagttattgg 
ctttttcgta cctattaatg tatcgtttta 
ggaaaaagca acttttttta ttgtcatagt 
taacatggaa aagtatactg agaaaaaaca 
acgtcatatt ggagagaatc aaatggattt 
tgtagctgat aaaactttag aaaaacagaa 
attttgtcct gtctgtgctt ggagaaaagc 
gatgcaatat attaagcagc aagagaaaaa 
taatgtaatg agtgatgaat tagaaaatga 
acttataaag agaaaaaaag taggtagtgt 
tacatataat aaaaaaagag atgattataa 
taaatcgtat ttcacagata aaagatatta 
gcgtgatgta acgggcattt cagaaataac 
taataataaa gaattatatg aaatggctaa 
aaataaatca aaaagtcttt gatgcatttt 
tttattcagg attatttaaa gaggctaaaa 
taaaagaaat tgatccaacc gtaatatatc 
gagtatttag ctagtgaact ttatgactta 
aaaatgatag acgaaatcga ggaagaacaa 
caagtttagt taagggttct tctcaacgca 
ctaattttta tttttaataa ccttgatagc 
acctataatc gataaccaca taacagtcat 
caagaaatat tt 



agtatcttcc tggcatcttc caggaaatct 3660 
cgcagtcgaa cgaccgagcg tagcgagtca 3720 
cacatattct gctgacgcac cggtgcagcc 3780 
tgacaccctc atcagtgcca acatagtaag 3840 
ccctgattct gtggataacc gtattaccgc 3900 
ccgaacgacc gagcgcagcg agtcagtgag 3960 
ttttctcctt acgcatctgt gcggtatttc 4020 
catgcaagct tctgtaggtt tttaggcata 4080 
aatgcccctt aaaattcaaa ataaaggcat 4140 
ttgttaaaaa ggagtggttt tatgactgtt 4200 
tgtattggaa tggcattttt tgctatcaag 4260 
ttatgccgag aaaatttatt gtgcgttgag 4320 
gtcggcatag cgtgagctat taagccgacc 4 380 
ccgaggctca acgtcaataa agcaattgga 44 40 
agaaaaagaa ggatatggat ctggagctgt 4500 
aggttagtga cattagaaaa ccgactgtaa 4560 
aaagccagtc attaggccta tctgacaatt 4 620 
gataaccatc acaaacagaa tgatgtacct 4680 
ctttattaat gaattttcct gctgtaataa 4740 
tatttaagtt aaacccagta aatgaagtcc 4800 
caggtatagg tgttttggga aacaatttcc 4860 
ggtataaatc ataaaactct ttgaagtcat 4920 
ttttagatac accatcaaaa attgtataaa 4980 
ctccgtcgct attgtaacca gttctaaaag 5040 
agaaaataaa tgcagggtaa aatttatatc 5100 
taatatcaat ttctgtggtt atactaaaag 5160 
tctcttttct cttccaattg tctaaatcaa 5220 
aaatttttat ctaaagtgaa tttaggaggc 5280 
tcctttttta aaagtcaata ttactgtaac 5340 
tatccaattt tcgtttgttg aactaatggg 5400 
aaaaatgtgg tcttttgtgt ttttttaaag 5460 
tcttatcttg ataataaggg taactattgc 5520 
tatgactggt tttaagcgca aaaaaagttg 5580 
aatgactagt aaaaaacata catagaaagg 5640 
ttgtgaaaac taagttgttt ttatgtgtta 5700 
aagaaatcaa gtatttcaga aatttattaa 5760 
agttgaagat tgcaatacat ttctgtcttt 5820 
attatataaa gctaattctt gtaaaaatcg 5880 
tagaaaagat gcattgggtt tatctttgat 5940 
ggagtttatc tttttaactt tgactacacc 6000 
aataaaacgt tataataatt cttttagaaa 6060 
tataaaggga tatgttcgta agttagagat 6120 
tcctcatttt catgtgttaa ttgcagtaaa 6180 
tattagccaa caagaatggt tagatttatg 6240 
acaagttcaa gttcaaaaaa taagacaaaa 6300 
gtattctggt aaagatagtg attatttaat 6360 
ataaatcact taaaggtaaa caggtattag 6420 
agaaattaaa aaatggggat ttagattact 6480 
tatcaaattt tttatatttg gaaacaaaaa 6540 
acagaacaag aaaaaagaga aattaatcac 6600 
taacaaaata taagtgctaa cagtcgtctg 6660 
caataaattt tctcggcata aatgcgtggt 6720 
aaaaaatgcc attccaatac aaaaccacat 6780 
aaaaccactc ctttttaaca aactttatca 6840 

6852 



<210> 17 
<211> 9437 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Vector pEPEF3 
<400> 17 

caacccagtc agctccttcc ggtgggcgcg 
tgtcttcttt atcatgcaac tcgtaggaca 
cgaggaccgc tttcgctgga gcgcgacgat 
cttgcacgcc ctcgctcaag ccttcgtcac 
gcaggccatt atcgccggca tggcggccga 
gacgcgaggc tggatggcct tccccattat 
gcccgcgttg caggccatgc tgtccaggca 
aggatcgctc gcggctctta ccagcctaac 
gatttatgcc gcctcggcga gcacatggaa 
ataccttgtc tgcctccccg cgttgcgtcg 
aatggaagcc ggcggcacct cgctaacgga 
ttcttgcgga gaactgtgaa tgcgcaaacc 
gccatctcca gcagccgcac gcggcgcatc 
cgcatgatcg tgctcctgtc gttgaggacc 
tagcagaatg aatcaccgat acgcgagcga 
gcgacctgag caacaacatg aatggtcttc 
cggaagtccc ctacgtgctg ctgaagttgc 
accacgatac tatgactgag agtcaacgcc 
caacagtccg caccgctgtc cggtagctcc 
gccgcactta tgactgtctt ctttatcatg 
aacagtcccc cggccacggg gcctgccacc 
attatgttcc ggatctgcat cgcaggatgc 
gtattaacga agcgctaacc gtttttatca 
cgtcaattat tacctccacg gggagagcct 
acacggtcac actgcttccg gtagtcaata 
ctatttaacg accctgccct gaaccgacga 
attcatccgc ttattatcac ttattcaggc 
ctgccttaaa aaaattacgc cccgccctgc 
agcattctgc cgacatggaa gccatcacag 
atcagcacct tgtcgccttg cgtataatat 
ttgtccatat tggccacgtt taaatcaaaa 
acgaaaaaca tattctcaat aaacccttta 
gccacatctt gcgaatatat gtgtagaaac 
agcgatgaaa acgtttcagt ttgctcatgg 
catatcacca gctcaccgtc tttcattgcc 
ctttggtttt aaatttttga aaaaaataaa 
atatatttta atcttttgtt cttttgcgtg 
gaaattaaac agtcacaaaa atcgatgtat 
atgcatcgat gtatacgagt cacaaaaatc 
tactgtatat agtataataa aagcatagag 
ctactactaa aacacgcaaa ggagcgtatt 
aaacaaaagc aggtgctgac cttgaatgaa 
gctttaattc aaagtgtggc taaaatgcaa 
gtgtcttgta ttgataccga agaaccccca 
gaacttttta aattttttga agtatcgtct 
gttaattata tgcaaaagca agcctttttc 
gagtatgaaa gcattgtgcc tattccttat 
acaatacgtt ttgaccaagc cattatgccc 
caatataaaa tttccgaact acaaaaactc 
tggttatcca tgaattataa ccaatacgaa 
gaacaagtgg aatcctaccg caacccctca 
gatacaataa atgaacatca acattttccc 
attgaagaaa tcaacgctca cacctctttt 
cggagcattg actctattgt ctttcatatt 
tacaagttgg gagataaaga ctatcaagac 



gggcatgact atcgtcgccg cacttatgac 60 
ggtgccggca gcgctctggg tcattttcgg 120 
gatcggcctg tcgcttgcgg tattcggaat 180 
tggtcccgcc accaaacgtt tcggcgagaa 240 
cgcgctgggc tacgtcttgc tggcgttcgc 300 
gattcttctc gcttccggcg gcatcgggat 360 
ggtagatgac gaccatcagg gacagcttca 420 
ttcgatcact ggaccgctga tcgtcacggc 4 80 
cgggttggca tggattgtag gcgccgccct 540 
cggtgcatgg agccgggcca cctcgacctg 600 
ttcaccactc caagaattgg agccaatcaa 660 
aacccttggc agaacatatc catcgcgtcc 720 
tcgggcagcg ttgggtcctg gccacgggtg 780 
cggctaggct ggcggggttg ccttactggt 840 
acgtgaagcg actgctgctg caaaacgtct 900 
ggtttccgtg tttcgtaaag tctggaaacg 960 
ccgcaacaga gagtggaacc aaccggtgat 1020 
atgagcggcc tcatttctta ttctgagtta 1080 
ttccggtggg cgcggggcat gactatcgtc 1140 
caactcgtag gacaggtgcc ggcagcgccc 1200 
atacccacgc cgaaacaagc gccctgcacc 1260 
tgctggctac cctgtggaac acctacatct 1320 
ggctctggga ggcagaataa atgatcatat 1380 
gagcaaactg gcctcaggca tttgagaagc 1440 
aaccggtaaa ccagcaatag acataagcgg 1500 
ccgggtcgaa tttgctttcg aatttctgcc 1560 
gtagcaccag gcgtttaagg gcaccaataa 1620 
cactcatcgc agtactgttg taattcatta 1680 
acggcatgat gaacctgaat cgccagcggc 1740 
ttgcccatgg tgaaaacggg ggcgaagaag 1800 
ctggtgaaac tcacccaggg attggctgag 1860 
gggaaatagg ccaggttttc accgtaacac 1920 
tgccggaaat cgtcgtggta ttcactccag 1980 
aaaacggtgt aacaagggtg aacactatcc 2040 
atacgaattc agaggtttga tgactttgac 2100 
aaaaggcgaa gcctattata tatttatcat 2160 
aaaaaaaagg cagtgttttc gctagttata 2220 
acagtcacaa aaatcgattt ttgtgactct 2280 
gatgtacaca gtacgacttt tgtatttgtg 2340 
aaaactcact atgaaatgac tttctctatg 2400 
tatactatga ttataccaga aaaacaaaac 2460 
ttagaaaaac gcaaagtagt ggagcataac 2520 
aaaacagctt taaaaatgtt tgagttagcc 2580 
aaaaataata ctgtttatct tttaaaatct 2640 
agcagtaagc atagtcaatt caaagaagca 2700 
aatattaaag ctgataagaa actaggtatt 2760 
gtaaaatgga acgattataa cgatgaagta 2820 
tatctgattg atttaaaggc agaatttact 2880 
aatagcaagt attcgattat tttgtatcgt 2940 
cattacagcg tgaaaggggg acggagagtt 3000 
ataaaagtga aagaattgcg agaaataact 3060 
cattttgaaa ctagagtatt aaaaaaagca 3120 
aatgtaacct atgagaaaaa gaaagcaggg 3180 
gagaagaaac gcatggcaga cgataacagt 324 0 
gataaaaagc aaaaatcaag aaatgaagct 3300 
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gacttactaa aacaggcaat ggaaagtaaa 
attggcatga atgatattat ggatacagcc 
ccgctttatg acgaactgaa agaattaaga 
tatgttgcca gcaaaaaaga agcctattct 
gccattgaac aatatctgcc aacggttaaa 
aaaaacgata aaagagttgg cggacgagct 
tttaaagttt ttgccgaccg aaaatcgaca 
ccttgaagaa caaaagttta tcaaaaaaag 
cgaaaatcga caagaaaaaa tgaagaatta 
gatgagcatt catcaggcgg gcaagaatgt 
ttttctttac ggtctttaaa aaggccgtaa 
attgagcaac tgactgaaat gcctcaaaat 
cggtggtata tccagtgatt tttttctcca 
ataactcaaa aaatacgccc ggtagtgatc 
ttacgtgccg atcaacgtct cattttcgcc 
cagggacacc aggatttatt tattctgcga 
cgcaaagtgc gtcgggtgat gctgccaact 
ggtgctccag tggcttctgt ttctatcagc 
gtgcgtaacg gcaaaagcac cgccggacat 
atgttggcac. tgatgagggt gtcagtgaag 
accggtgcgt cagcagaata tgtgatacag 
gctacgctcg gtcgttcgac tgcggcgagc 
cctggaagat gccaggaaga tacttaacag 
ttccataggc tccgcccccc tgacaagcat 
cgaaacccga caggactata aagataccag 
tctcctgttc ctgcctttcg gtttaccggt 
cattccacgc ctgacactca gttccgggta 
cgaacccccc gttcagtccg accgctgcgc 
cccggaaaga catgcaaaag caccactggc 
tagtcttgaa gtcatgcgcc ggttaaggct 
ctcctccaag ccagttacct cggttcaaag 
cgccctgcaa ggcggttttt tcgttttcag 
ctcaagaaga tcatcttatt aatcagataa 
cttcaaatgt agcacctgaa gtcagcccca 
cagcttatca tcgatacaaa ttccccgtag 
ctgtcagtag tatatctaat aatttatctc 
atttaaaaaa gcgactcata gaattatttc 
atagacaata cttgctcata agtaatggta 
gcttgatgaa actgattttt agtaaacagt 
acaaagtacg tatatagctt ccaatattta 
tttattaaga cactgtttac ttttggttta 
aattgctgaa tcgagacttg agtgtgcaag 
gtacgcttgt agaatccttc ttcaacaatc 
accacttttt taataatttg tgtgcttaaa 
ctctgtttgt tagggaattg aaactgtaga 
ttcagtttta atttttctga cgataagttg 
acctgtttac ttattttagc cagtttcgtc 
taaacggtat cggtttcttt taaattcaat 
ctcgttaaaa agttttgaga atattttata 
ttacatctat aaacaaatac agaagttaaa 
aaaaagtcat aagattagtc actggtagga 
atcactgttt ttagtctgtt tcaaaacagt 
taccaacatg acgaatccct ccttcttaat 
attcctatta tacaaaattt taagataatg 
aagtcttatt tccataactt cttttacgtt 
gatatggtgc aagtcagcac gaacacgaac 
actgattggt gtatcatttc gtttttcttt 
cttctttcta tttcttctta ttcttatttt 
tgtgataagt gatttgtgat aagtgatttg 
gcttttatgt ttattttgag aaagatataa 
caacggcatt gtagggcttt ttctctttgt 



tatacaagac ttttatctga aaatttctta 3360 
acaatggtag gcttacagaa gaatgtttat 3420 
gggctaaatg gtgtcaaaga ccacttgtct 3480 
aaacgtaatg tagcgaaata tcttaaaaaa 3540 
aggcaggact taaaccatga gtgaagactt 3600 
aggactagct aaacagaccg ttcaatacca 3660 
aaaaaacaat agtggttcga tcgtcctaaa 3720 
actaacaaaa ataaaaaaag ttcaaccgac 3780 
agcgttaata gtcttttgtt aggaattccg 384 0 
gaataaaggc cggataaaac ttgtgcttat 3900 
tatccagctg aacggtctgg ttataggtac 3960 
gttctttacg atgccattgg gatatatcaa 4020 
ttttagcttc cttagctcct gaaaatctcg 4080 
ttatttcatt atggtgaaag ttggaacctc 4140 
aaaagttggc ccagggcttc ccggtatcaa 4200 
agtgatcttc cgtcacaggt atttattcgg 4260 
tactgattta gtgtatgatg gtgtttttga 4320 
tgtccctcct gttcagctac tgacggggtg 4380 
cagcgctagc ggagtgtata ctggcttact 4440 
tgcttcatgt ggcaggagaa aaaaggctgc 4500 
gatatattcc gcttcctcgc tcactgactc 4560 
ggaaatggct tacgaacggg gcggagattt 4 620 
ggaagtgaga gggccgcggc aaagccgttt 4680 
cacgaaatct gacgctcaaa tcagtggtgg 4740 
gcgtttcccc ctggcggctc cctcgtgcgc 4800 
gtcattccgc tgttatggcc gcgtttgtct 4860 
ggcagttcgc tccaagctgg actgtatgca 4 920 
cttatccggt aactatcgtc ttgagtccaa 4 980 
agcagccact ggtaattgat ttagaggagt 504 0 
aaactgaaag gacaagtttt ggtgactgcg 5100 
agttggtagc tcagagaacc ttcgaaaaac 5160 
agcaagagat tacgcgcaga ccaaaacgat 5220 
aatatttcta gatttcagtg caatttatct 5280 
tacgatataa gttgtaattc tcatgtttga 5340 
gcgctaggga cctctttagc ttcttggaag 5400 
cattcccttt agtaacgtgt aactttccaa 5460 
ctcccgttaa ataatagata actattaaaa 5520 
cttaaattgt ttactttggc gtgtttcatt 5580 
tgacgatatt ctcgattgac ccattttgaa 5640 
tctggaacat ctgtggtatg gcgggtaagt 5700 
ggatgaaagc attccgctgg cagcttaagc 57 60 
agcaacccta gtgttcggtg aatatccaag 5820 
agatagatgt cagacgcacg gctttcaaaa 5880 
tggtaaggaa tattcccaac aattttatac 594 0 
atatcttggt gaattaaagt gacacgagta 6000 
aatagatgac tgtctaattc aatagacgtt 6060 
gttaaatgcc ctttacctgt tccaatttcg 6120 
tgttttatta tttggttgag tactttttca 6180 
tttttgttca tgtaatctct cctgaagtga 624 0 
cgatttgttt gtaattttag ttatctgttt 6300 
attaatctaa cgtatttatt tatctgcgta 6360 
agatgtttta tctacattac gcatttggaa 6420 
tacaaatttt tagcatctaa tttaacttca 6480 
cactatcaac acactcttaa gtttgcttct 654 0 
tccgccattc tttgctgttt cgatttttat 6600 
cgtcttatct cccattatat ctttttttgc 6660 
ttgtgcgcct aaatttccca caatcactca 6720 
atcatcaaca atcacaaatc acttgtgatt 6780 
tgattaatat ataaaagccc tctttaaagg 684 0 
aatcaatata tcccttttcc ccaattttta 6900 
ttttgatttc cttttaccag ggtttctttt 6960 
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ccctccagta attcttttac attcgttttg 
aaagtaaact tacattcagg ataatttgaa 
ttgttgccac acttaggaca ttttcctaca 
tcctggtaat cagaaaaatt tagtttttct 
acaatgaatt ttttgatatt cgtaataaag 
ttttttaaat acgtttccca tttagccgtc 
tcaactgcct ggcacaataa ttttcctttt 
cgataaccac ataacagtca taaaaccact 
ttttggcatt ctacgactat aacttaaatt 
ttatagaata atgttgctca tatcgtttgc 
acacttggta atattaaccg aacttcccct 
ggaaaataat tttatagtaa atgaaaatga 
cagcaatatt tcacgaacac aaatatcgaa 
taacattctg aacaacttaa aacacaagag 
tactaaaagt ggtggacgaa agcctatttt 
tatttctatg gatttaacat atgattccgt 
tatattaaag caagattcct acgaattaaa 
tttaaaatct aatataaacg tctcagaaaa 
atctatatcc atacacggta tcgttgacga 
taaaaatgag aaacgcacat ttaccgatga 
tatagaaaat gaagcaaatt tatcagcgct 
cataaataat ttgattactt taagtattca 
taaaaaactt tatcgtggct caaatggaga 
ggaatctata aataacaatg acaacaaata 
cgctttaata cagaaaataa ataataggtt 
ccaatattac aacgaaggaa attcaattgt 
aatgacagtt ctgattcata atttgaatac 
ctgtccttta attaatgaat taccaaatat 
tttttctcaa ggcagtccaa ttcaattaca 
attgggtggc actttagcaa taatgcaaaa 
tattaaataa ttacagcagt ctgagttata 
tattcttgac atgttagtga gggggctggt 
gtttgtttat taaattaacc aactggatcc 
aagcttgggc ttttcagcct gatacagatt 
cagaatttgc ctggcggcag tagcgcggtg 
gtgaaacgcc gtagcgccga tggtagtgtg 
caggcatcaa ataaaacgaa aggctcagtc 
tttgtcggtg aacgctctcc tgagtaggac 
gaagcaacgg cccggagggt ggcgggcagg 
taagcagaag gccatcctga cggatggcct 
tttttctaaa tacattcaaa tatgtatccg 
atgcccttga gagcctt 



gttagttttt tctttctaaa atgttcagct 7020 
caaccataaa acgatttttt taatacaata 7080 
atactttttt ctgcttcttt ttctttctgt 7140 
atatcgttag gtacagcttc cagtaaatga 7200 
ttctcttgat tgccttctct tttaccgatt 7260 
atttcagcac tcgttaaaag gtgctgactt 7320 
tcagttacaa caagcttgcc tgcaggtcga 7380 
cctttttaac aaactttatc acaagaaata 7440 
tatatttttt actttataat atataattga 7500 
caacatctag tactcaaatt acactatgtt 7560 
gtccaaatta gataagaggt aataataaat 7620 
gaagcgtgta ttaaaacaaa ttttcaataa 7680 
gaatttagaa cttaataaag ctactatttc 7740 
tttagttaat gaagtaggag aaggtaatag 7800 
actcgaaatt aaccaaaaat atggctacta 7860 
tgaattaatg tacaactact ttgatgctac 7920 
tgataaaaat gtaagcagta tattacaaat 7980 
atatgatacg ttatatgggt tacttggtat 8040 
tgagcaaaac ataatcaatc ttccttttca 8100 
attaaagtca ttcacaaatg ttcctgtcgt 8160 
atatgaaaaa agtttatata ttaattcaaa 8220 
caagggtata ggcgctggca tcataataaa 8280 
ggctggagag ataggtaaga cattggtttt 8340 
ttataaaatc gaagatatat gctcccaaga 8400 
gggcgtcaca ttgacgttta cagaactaat 84 60 
tgctcatgaa attaaacaat ttattaataa 8520 
acaatttaac ccagacgcta tttatattaa 8580 
tttaaatgaa attaaagagc aattctcctg 8640 
tttaactact aatgtaaaac aagctacttt 8700 
aacattaaat ataaataaca ttcaaatgaa 8760 
aaatagatat ctcggaccgc tttggcagtt 8820 
ataatcacat agtactgttc tcgagagtta 8880 
gaattcccgg ggatccgtcg acctgcagcc 8940 
aaatcagaac gcagaagcgg tctgataaaa 9000 
gtcccacctg accccatgcc gaactcagaa 9060 
gggtctcccc atgcgagagt agggaactgc 9120 
gaaagactgg gcctttcgtt ttatctgttg 9180 
aaatccgccg ggagcggatt tgaacgttgc 9240 
acgcccgcca taaactgcca ggcatcaaat 9300 
ttttgcgttt ctacaaactc tttttgttta 9360 
ctgatcccca tcctatcgga tcctcgaccg 9420 

9437 



<210> 18 
<211> 59 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide PL10 
<400> 18 

gaccgtttcg tgaacttttt tgttgacaaa gataaaaaca catgatataa ttaaatcac 59 

<210> 19 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Oligonucleotide PL10 complement 
<400> 19 

tcgagtgatt taattatatc atgtgttttt atctttgtca acaaaaaagt tcacgaaacg 60 



<210> 20 
<211> 64 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide P59* 
<400> 20 

gaccgaaaaa tgacagttta ttcttgacag ggagagatag gtttgatata atataatagt 60 
tgtc 64 

<210> 21 
<211> 65 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide P59* complement 
<400> 21 

tcgagacaac tattatatta tatcaaacct atctctccct gtcaagaata aactgtcatt 60 
tttcg 65 

<210> 22 
<211> 94 
<212> DNA 

<213> Artificial Sequence 
<22,0> 

<223> Oligonucleotide Xyl-T5-DD 
<400> 22 

gaccgtcata aaaaatttat tttacatcag gaaaattttt ctgtatatta gattcaagtt 60 
agtttgttta ttaaattaac caactaaaat gtag 94 

<210> 23 
<211> 95 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide Xyl-T5-DD complement 
<400> 23 

aattctacat tttagttggt taatttaata aacaaactaa cttgaatcta atatacagaa 60 
aaattttcct gatgtaaaat aaatttttta tgacg 95 

<210> 24 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 



-16- 



WO 02/051982 



PCT/US01750250 



<223> pXyl-T5F primer 
<400> 24 

cagcagtctg agttataaaa tag 

<210> 25 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> LexL primer 
<4O0> 25 

tgttttatca gaccgctt 

<210> 26 
<211> 80 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Xyl-T5 fusion sequence 
<400> 26 

tcataaaaaa tttatttgct ttcaggaaaa 
gtttattaaa ttaaccaact 



18 



tttttctgta taatagattc aagttagttt 60 

80 



<210> 27 
<211> 80 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Xyl-T5-DD fusion sequence 



<400> 27 

tcataaaaaa tttattttac atcaggaaaa tttttctgta tattagattc aagttagttt 60 
gtttattaaa ttaaccaact 80 



<210> 28 
<211> 95 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> CP25 fusion sequence 



<400> 28 

ctttggcagt ttattcttga catgttagtg agggggctgg tataatcaca tagtactgtt 60 
ctcgagagtt agtttgttta ttaaattaac caact 95 



<210> 29 
<211> 86 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> P32 fusion sequence 



<400> 29 
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gtgagcttgg actagaaaaa aacttcacaa aatgctatac taggtaggta actcgagagt 60 
tagtttgttt attaaattaa ccaact 86 

<210> 30 
<211> 86 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> P59 fusion sequence 
<400> 30 

acattaaatt cttgacaggg agagataggt ttgatagaat ataatagttg tctcgagagt 60 
tagtttgttt attaaattaa ccaact 86 

<210> 31 
<211> 93 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> P59* fusion sequence 
<400> 31 

aaaaatgaca gtttattctt gacagggaga gataggtttg atataatata atagttgtct 60 
cgagagttag tttgtttatt aaattaacca act 93 

<210> 32 
<211> 150 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> P1P2 fusion sequence 
<400> 32 

gtctagttat tcttgacaaa aaataatatt ttgatataat taaatagttg tcgtttgata 60 
acacagttta ttcttgacaa aaaaatataa aagtgtataa tagaaaagta ctgttctcga 120 
gagttagttt gtttattaaa ttaaccaact 150 

<210> 33 
<211> 242 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> P1P2* fusion sequence 
<400> 33 

aaacagccga ataaggctgt ttttcttttt tttatgtttt agaataagtg gtctagttta 60 
ttcttgacaa aaaataatat tttgatataa ttaaatagtt gtcgtttgag acgactgact 120 
ttcttattat tcatctaaaa tattattttg aaaagataac acagttattc ttgacaaaaa 180 
aatataaaag tgtataatag aaaagtactc gagagttagt ttgtttatta aattaaccaa 240 
ct 242 

<210> 34 
<211> 88 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> PL fusion sequence 
<400> 34 

tttcgtgaac ttttttgttg acaaagataa aaacacatga tatacttaaa tcactcgaga 60 
gttagtttgt ttattaaatt aaccaact 88 

<210> 35 
<211> 88 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PL10 fusion sequence 
<400> 35 

tttcgtgaac ttttttgttg acaaagataa aaacacatga tataattaaa tcactcgaga 60 
gttagtttgt ttattaaatt aaccaact 88 

<210> 36 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Xyl-T5 promoter sequence 
<400> 36 

tcataaaaaa tttatttgct ttcaggaaaa tttttctgta taatagattc a 51 

<210> 37 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Xyl-T5-DD promoter sequence 
<400> 37 

tcataaaaaa tttattttac atcaggaaaa tttttctgta tattagattc a 51 

<210> 38 

<211> 60 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> CP25 promoter sequence 

<400> 38 

ctttggcagt ttattcttga catgttagtg agggggctgg tataatcaca tagtactgtt 60 



<210> 39 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> P32 promoter sequence 
<400> 39 
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gtgagcttgg actagaaaaa aacttcacaa aatgctatac taggtaggta a 51 

<210> 40 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> P59 promoter sequence 
<400> 40 

acattaaatt cttgacaggg agagataggt ttgatagaat ataatagttg t 51 

<210> 41 
<211> 58 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> P59* promoter sequence 
<400> 41 

aaaaatgaca gtttattctt gacagggaga gataggtttg atataatata atagttgt 58 

<210> 42 
<211> 115 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> P1P2 promoter sequence 
<400> 42 

gtctagttat tcttgacaaa aaataatatt ttgatataat taaatagttg tcgtttgata 60 
acacagttta ttcttgacaa aaaaatataa aagtgtataa tagaaaagta ctgtt 115 

<210> 43 
<211> 207 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> P1P2* promoter sequence 
<400> 43 

aaacagccga ataaggctgt ttttcttttt tttatgtttt agaataagtg gtctagttta 60 
ttcttgacaa aaaataatat tttgatataa ttaaatagtt gtcgtttgag acgactgact 120 
ttcttattat tcatctaaaa tattattttg aaaagataac acagttattc ttgacaaaaa 180 
aatataaaag tgtataatag aaaagta 207 

<210> 44 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PL promoter sequence 
<400> 44 

ttttttgttg acaaagataa aaacacatga tatacttaaa tea 4 3 
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<210> 45 
<211> 53 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PL10 promoter sequence 
<400> 45 

tttcgtgaac ttttttgttg acaaagataa aaacacatga tataattaaa tea 53 

<210> 46 
<211> 9425 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Vector pEPEF9 
<400> 46 

acccagtcag ctccttccgg tgggcgcggg gcatgactat cgtcgccgca cttatgactg 60 
tcttctttat catgcaactc gtaggacagg tgccggcagc gctctgggtc atttteggeg 120 
aggaccgett tegctggage gcgacgatga tcggcctgtc gettgeggta tteggaatet 180 
tgcacgccct cgctcaagcc ttcgtcactg gtcccgccac caaaegttte ggegagaage 240 
aggecattat cgccggcatg gcggccgacg cgctgggcta cgtcttgctg gcgttcgcga 300 
cgcgaggctg gatggectte cccattatga ttcttctcgc ttccggcggc ategggatge 360 
ccgcgttgca ggccatgctg tccaggcagg tagatgacga ccatcaggga cagcttcaag 420 
gatcgctcgc ggctcttacc agectaaett cgatcactgg accgctgatc gtcaeggega 480 
tttatgeege ctcggcgagc acatggaacg ggttggcatg gattgtaggc gccgccctat 540 
accttgtctg cctccccgcg ttgcgtcgcg gtgcatggag ccgggccacc tcgacctgaa 600 
tggaagcegg cggcacctcg etaaeggatt caccactcca agaattggag ccaatcaatt 660 
ettgeggaga actgtgaatg cgcaaaccaa cccttggcag aacatatcca tcgcgtccgc 720 
catctccagc agccgcacgc ggcgcatctc gggcagcgtt gggtcctggc cacgggtgcg 780 
catgategtg ctcctgtcgt tgaggacccg gctaggctgg eggggttgee ttactggtta 840 
gcagaatgaa tcaccgatac gegagegaac gtgaagegae tgetgetgea aaacgtctgc 900 
gacctgagca acaacatgaa tggtcttegg tttccgtgtt tegtaaagtc tggaaacgcg 960 
gaagtcccct aegtgetget gaagttgccc gcaacagaga gtggaaccaa ccggtgatac 1020 
cacgatacta tgactgagag tcaacgccat gagcggcctc atttcttatt ctgagttaca 1080 
acagtccgca ccgctgtccg gtagctcctt ccggtgggcg eggggcatga ctatcgtcgc 1140 
cgcacttatg actgtcttct ttatcatgea actegtagga caggtgeegg cagcgcccaa 1200 
cagtcccccg gccacggggc ctgccaccat acccacgccg aaacaagege cctgcaccat 1260 
tatgttccgg atetgeateg caggatgetg ctggctaccc tgtggaacac ctacatctgt 1320 
attaacgaag cgctaaccgt ttttatcagg ctctgggagg cagaataaat gatcatatcg 1380 
tcaattatta cctccacggg gagagectga gcaaactggc ctcaggcatt tgagaagcac 14 40 
acggtcacac tgcttccggt agtcaataaa ccggtaaacc agcaatagac ataagegget 1500 
atttaacgac cctgccctga accgacgacc gggtcgaatt tgctttcgaa tttctgecat 1560 
tcatccgctt attatcactt attcaggegt agcaccaggc gtttaagggc accaataact 1620 
gecttaaaaa aattacgccc cgccctgcca ctcatcgcag tactgttgta attcattaag 1680 
cattctgccg acatggaagc catcacagac ggcatgatga acctgaatcg ccagcggcat 1740 
cagcaccttg tcgccttgcg tataatattt gcccatggtg aaaacggggg cgaagaagtt 1800 
gtccatattg gecaegttta aatcaaaact ggtgaaactc acccagggat tggctgagac 1860 
gaaaaacata ttctcaataa accctttagg gaaataggee aggttttcac cgtaacacgc 1920 
cacatcttgc gaatatatgt gtagaaactg ceggaaateg tcgtggtatt cactccagag 1980 
cgatgaaaac gtttcagttt gctcatggaa aacggtgtaa caagggtgaa cactatccca 2040 
tatcaccagc tcaccgtctt teattgecat acgaattcag aggtttgatg actttgacct 2100 
ttggttttaa atttttgaaa aaaataaaaa aaggegaage ctattatata tttatcatat 2160 
atattttaat cttttgttct tttgcgtgaa aaaaaaggca gtgttttcgc tagttataga 2220 
aattaaacag tcacaaaaat cgatgtatac agtcacaaaa atcgattttt gtgactctat 2280 
gcatcgatgt atacgagtca caaaaatcga tgtacacagt acgacttttg tatttgtgta 2340 
ctgtatatag tataataaaa gcatagagaa aactcactat gaaatgactt tetctatget 2400 
actactaaaa caegcaaagg agegtattta tactatgatt ataccagaaa aacaaaacaa 24 60 
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acaaaagcag gtgctgacct tgaatgaatt agaaaaacgc aaagtagtgg agcataacgc 2520 
tttaattcaa agtgtggcta aaatgcaaaa aacagcttta aaaatgtttg agttagccgt 2580 
gtcttgtatt gataccgaag aacccccaaa aaataatact gtttatcttt taaaatctga 2640 
actttttaaa ttttttgaag tatcgtctag cagtaagcat agtcaattca aagaagcagt 2700 
taattatatg caaaagcaag cctttttcaa tattaaagct gataagaaac taggtattga 2760 
gtatgaaagc attgtgccta ttccttatgt aaaatggaac gattataacg atgaagtaac 2820 
aatacgtttt gaccaagcca ttatgcccta tctgattgat ttaaaggcag aatttactca 2880 
atataaaatt tccgaactac aaaaactcaa tagcaagtat tcgattattt tgtatcgttg 2940 
gttatccatg aattataacc aatacgaaca ttacagcgtg aaagggggac ggagagttga 3000 
acaagtggaa tcctaccgca acccctcaat aaaagtgaaa gaattgcgag aaataactga 3060 
tacaataaat gaacatcaac attttcccca ttttgaaact agagtattaa aaaaagcaat 3120 
tgaagaaatc aacgctcaca cctcttttaa tgtaacctat gagaaaaaga aagcagggcg 3180 
gagcattgac tctattgtct ttcatattga gaagaaacgc atggcagacg ataacagtta 3240 
caagttggga gataaagact atcaagacga taaaaagcaa aaatcaagaa atgaagctga 3300 
cttactaaaa caggcaatgg aaagtaaata tacaagactt ttatctgaaa atttcttaat 3360 
tggcatgaat gatattatgg atacagccac aatggtaggc ttacagaaga atgtttatcc 34 20 
gctttatgac gaactgaaag aattaagagg gctaaatggt gtcaaagacc acttgtctta 3480 
tgttgccagc aaaaaagaag cctattctaa acgtaatgta gcgaaatatc ttaaaaaagc 3540 
cattgaacaa tatctgccaa cggttaaaag gcaggactta aaccatgagt gaagacttaa 3600 
aaacgataaa agagttggcg gacgagctag gactagctaa acagaccgtt caataccatt 3660 
taaagttttt gccgaccgaa aatcgacaaa aaaacaatag tggttcgatc gtcctaaacc 3720 
ttgaagaaca aaagtttatc aaaaaaagac taacaaaaat aaaaaaagtt caaccgaccg 3780 
aaaatcgaca agaaaaaatg aagaattaag cgttaatagt cttttgttag gaattccgga 3840 
tgagcattca tcaggcgggc aagaatgtga ataaaggccg gataaaactt gtgcttattt 3900 
ttctttacgg tctttaaaaa ggccgtaata tccagctgaa cggtctggtt ataggtacat 3960 
tgagcaactg actgaaatgc ctcaaaatgt tctttacgat gccattggga tatatcaacg 4020 
gtggtatatc cagtgatttt tttctccatt ttagcttcct tagctcctga aaatctcgat 4080 
aactcaaaaa atacgcccgg tagtgatctt atttcattat ggtgaaagtt ggaacctctt 4140 
acgtgccgat caacgtctca ttttcgccaa aagttggccc agggcttccc ggtatcaaca 4200 
gggacaccag gatttattta ttctgcgaag tgatcttccg tcacaggtat ttattcggcg 4260 
caaagtgcgt cgggtgatgc tgccaactta ctgatttagt gtatgatggt gtttttgagg 4320 
tgctccagtg gcttctgttt ctatcagctg tccctcctgt tcagctactg acggggtggt 43B0 
gcgtaacggc aaaagcaccg ccggacatca gcgctagcgg agtgtatact ggcttactat 4 4 40 
gttggcactg atgagggtgt cagtgaagtg cttcatgtgg caggagaaaa aaggctgcac 4500 
cggtgcgtca gcagaatatg tgatacagga tatattccgc ttcctcgctc actgactcgc 4560 
tacgctcggt cgttcgactg cggcgagcgg aaatggctta cgaacggggc ggagatttcc 4 620 
tggaagatgc caggaagata cttaacaggg aagtgagagg gccgcggcaa agccgttttt 4 680 
ccataggctc cgcccccctg acaagcatca cgaaatctga cgctcaaatc agtggtggcg 4740 
aaacccgaca ggactataaa gataccaggc gtttccccct ggcggctccc tcgtgcgctc 4 800 
tcctgttcct gcctttcggt ttaccggtgt cattccgctg ' ttatggccgc gtttgtctca 4860 
ttccacgcct gacactcagt tccgggtagg cagttcgctc caagctggac tgtatgcacg 4920 
aaccccccgt tcagtccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc 4 980 
cggaaagaca tgcaaaagca ccactggcag cagccactgg taattgattt agaggagtta 5040 
gtcttgaagt catgcgccgg ttaaggctaa actgaaagga caagttttgg tgactgcgct 5100 
cctccaagcc agttacctcg gttcaaagag ttggtagctc agagaacctt cgaaaaaccg 5160 
ccctgcaagg cggttttttc gttttcagag caagagatta cgcgcagacc aaaacgatct 5220 
caagaagatc atcttattaa tcagataaaa tatttctaga tttcagtgca atttatctct 5280 
tcaaatgtag cacctgaagt cagccccata cgatataagt tgtaattctc atgtttgaca 5340 
gcttatcatc gatacaaatt ccccgtaggc gctagggacc tctttagctt cttggaagct 5400 
gtcagtagta tatctaataa tttatctcca ttccctttag taacgtgtaa ctttccaaat 5460 
ttaaaaaagc gactcataga attatttcct cccgttaaat aatagataac tattaaaaat 5520 
agacaatact tgctcataag taatggtact taaattgttt actttggcgt gtttcattgc 5580 
ttgatgaaac tgatttttag taaacagttg acgatattct cgattgaccc attttgaaac 5640 
aaagtacgta tatagcttcc aatatttatc tggaacatct gtggtatggc gggtaagttt 5700 
tattaagaca ctgtttactt ttggtttagg atgaaagcat tccgctggca gcttaagcaa 5760 
ttgctgaatc gagacttgag tgtgcaagag caaccctagt gttcggtgaa tatccaaggt 5820 
acgcttgtag aatccttctt caacaatcag atagatgtca gacgcacggc tttcaaaaac 5880 
cactttttta ataatttgtg tgcttaaatg gtaaggaata ttcccaacaa ttttatacct 5940 
ctgtttgtta gggaattgaa actgtagaat atcttggtga attaaagtga cacgagtatt 6000 
cagttttaat ttttctgacg ataagttgaa tagatgactg tctaattcaa tagacgttac 6060 
ctgtttactt attttagcca gtttcgtcgt taaatgccct ttacctgttc caatttcgta 6120 
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aacggtatcg gtttctttta aattcaattg 
cgttaaaaag ttttgagaat attttatatt 
acatctataa acaaatacag aagttaaacg 
aaagtcataa gattagtcac tggtaggaat 
cactgttttt agtctgtttc aaaacagtag 
ccaacatgac gaatccctcc ttcttaatta 
tcctattata caaaatttta agataatgca 
gtcttatttc cataacttct tttacgtttc 
tatggtgcaa gtcagcacga acacgaaccg 
tgattggtgt atcatttcgt ttttcttttt 
tctttctatt tcttcttatt cttattttat 
tgataagtga tttgtgataa gtgatttgtg 
ttttatgttt attttgagaa agatataaaa 
acggcattgt agggcttttt ctctttgttt 
ctccagtaat tcttttacat tcgttttggt 
agtaaactta cattcaggat aatttgaaca 
gttgccacac ttaggacatt ttcctacaat 
ctggtaatca gaaaaattta gtttttctat 
aatgaatttt ttgatattcg taataaagtt 
ttttaaatac gtttcccatt tagccgtcat 
aactgcctgg cacaataatt ttcctttttc 
ataaccacat aacagtcata aaaccactcc 
ttggcattct acgactataa cttaaattta 
atagaataat gttgctcata tcgtttgcca 
acttggtaat attaaccgaa cttcccctgt 
aaaataattt tatagtaaat gaaaatgaga 
gcaatatttc acgaacacaa atatcgaaga 
acattctgaa caacttaaaa cacaagagtt 
ctaaaagtgg tggacgaaag cctattttac 
tttctatgga tttaacatat gattccgttg 
tattaaagca agattcctac gaattaaatg 
taaaatctaa tataaacgtc tcagaaaaat 
ctatatccat acacggtatc gttgacgatg 
aaaatgagaa acgcacattt accgatgaat 
tagaaaatga agcaaattta tcagcgctat 
taaataattt gattacttta agtattcaca 
aaaaacttta tcgtggctca aatggagagg 
aatctataaa taacaatgac aacaaatatt 
ctttaataca gaaaataaat aataggttgg 
aatattacaa cgaaggaaat tcaattgttg 
tgacagttct gattcataat ttgaatacac 
gtcctttaat taatgaatta ccaaatattt 
tttctcaagg cagtccaatt caattacatt 
tgggtggcac tttagcaata atgcaaaaaa 
ttaaataatt acagcagtct gagttataaa 
agaaaaaaac ttcacaaaat gctatactag 
aaattaacca actggacgaa ttcccgggga 
tcagcctgat acagattaaa tcagaacgca 
gcggcagtag cgcggtggtc ccacctgacc 
gcgccgatgg tagtgtgggg tctccccatg 
aaacgaaagg ctcagtcgaa agactgggcc 
gctctcctga gtaggacaaa tccgccggga 
ggagggtggc gggcaggacg cccgccataa 
atcctgacgg atggcctttt tgcgtttcta 
attcaaatat gtatccgctg atccccatcc 
cttca 



ttttattatt tggttgagta ctttttcact 6180 
tttgttcatg taatctctcc tgaagtgatt 6240 
atttgtttgt aattttagtt atctgtttaa 6300 
taatctaacg tatttattta tctgcgtaat 6360 
atgttttatc tacattacgc atttggaata 6420 
caaattttta gcatctaatt taacttcaat 6480 
ctatcaacac actcttaagt ttgcttctaa 6540 
cgccattctt tgctgtttcg atttttatga 6600 
tcttatctcc cattatatct ttttttgcac 6660 
gtgcgcctaa atttcccaca atcactcact 6720 
catcaacaat cacaaatcac ttgtgatttg 6780 
attaatatat aaaagccctc tttaaagggc 6840 
tcaatatatc ccttttcccc aatttttaca 6900 
ttgatttcct tttaccaggg tttcttttcc 6960 
tagttttttc tttctaaaat gttcagctaa 7020 
accataaaac gattttttta atacaatatt 7080 
acttttttct gcttcttttt ctttctgttc 7140 
atcgttaggt acagcttcca gtaaatgaac 7200 
ctcttgattg ccttctcttt taccgatttt 7260 
ttcagcactc gttaaaaggt gctgactttc 7320 
agttacaaca agcttgcctg caggtcgacg 7380 
tttttaacaa actttatcac aagaaatatt 7440 
tattttttac tttataatat ataattgatt 7500 
acatctagta ctcaaattac actatgttac 7560 
ccaaattaga taagaggtaa taataaatgg 7620 
agcgtgtatt aaaacaaatt ttcaataaca 7680 
atttagaact taataaagct actatttcta 7740 
tagttaatga agtaggagaa ggtaatagta 7800 
tcgaaattaa ccaaaaatat ggctactata 7860 
aattaatgta caactacttt gatgctacta 7 920 
ataaaaatgt aagcagtata ttacaaattt 7980 
atgatacgtt atatgggtta cttggtatat 8040 
agcaaaacat aatcaatctt ccttttcata 8100 
taaagtcatt cacaaatgtt cctgtcgtta 8160 
atgaaaaaag tttatatatt aattcaaaca 8220 
agggtatagg cgctggcatc ataataaata 8280 
ctggagagat aggtaagaca ttggttttgg 8340 
ataaaatcga agatatatgc tcccaagacg 8400 
gcgtcacatt gacgtttaca gaactaatcc 8460 
ctcatgaaat taaacaattt attaataaaa 8520 
aatttaaccc agacgctatt tatattaact 8580 
taaatgaaat taaagagcaa ttctcctgtt 8640 
taactactaa tgtaaaacaa gctactttat 8700 
cattaaatat aaataacatt caaatgaata 8760 
atagatatct cggaccggtg agcttggact 8820 
gtaggtaact cgagagttag tttgtttatt 8880 
tccgtcgacc tgcagccaag cttgggcttt 8 940 
gaagcggtct gataaaacag aatttgcctg 9000 
ccatgccgaa ctcagaagtg aaacgccgta 9060 
cgagagtagg gaactgccag gcatcaaata 9120 
tttcgtttta tctgttgttt gtcggtgaac 9180 
gcggatttga acgttgcgaa gcaacggccc 9240 
actgccaggc atcaaattaa gcagaaggcc 9300 
caaactcttt ttgtttattt ttctaaatac 9360 
tatcgatccc cacccgatgc ccttgagagc 94 20 

9425 



<210> 47 
<211> 12733 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Vector pEPEF14 
<400> 47 

aagcttgcct gcaggtcgac gataaccaca taacagtcat aaaaccactc ctttttaaca 60 

aactttatca caagaaatat tttggcattc tacgactata acttaaattt atatttttta 120 

ctttataata tataattgat tatagaataa tgttgctcat atcgtttgcc aacatctagt 180 

actcaaatta cactatgtta cacttggtaa tattaaccga acttcccctg tccaaattag 240 

ataagaggta ataataaatg gaaaataatt ttatagtaaa tgaaaatgag aagcgtgtat 300 

taaaacaaat tttcaataac agcaatattt cacgaacaca aatatcgaag aatttagaac 360 

ttaataaagc tactatttct aacattctga acaacttaaa acacaagagt ttagttaatg 420 

aagtaggaga aggtaatagt actaaaagtg gtggacgaaa gcctatttta ctcgaaatta 480 

accaaaaata tggctactat atttctatgg atttaacata tgattccgtt gaattaatgt 540 

acaactactt tgatgctact atattaaagc aagattccta cgaattaaat gataaaaatg 600 

taagcagtat attacaaatt ttaaaatcta atataaacgt ctcagaaaaa tatgatacgt 660 

tatatgggtt acttggtata tctatatcca tacacggtat cgttgacgat gagcaaaaca 720 

taatcaatct tccttttcat aaaaatgaga aacgcacatt taccgatgaa ttaaagtcat 780 

tcacaaatgt tcctgtcgtt atagaaaatg aagcaaattt atcagcgcta tatgaaaaaa 840 

gtttatatat taattcaaac ataaataatt tgattacttt aagtattcac aagggtatag 900 

gcgctggcat cataataaat aaaaaacttt atcgtggctc aaatggagag gctggagaga 960 

taggtaagac attggttttg gaatctataa ataacaatga caacaaatat tataaaatcg 1020 

aagatatatg ctcccaagac gctttaatac agaaaataaa taataggttg ggcgtcacat 1080 

tgacgtttac agaactaatc caatattaca acgaaggaaa ttcaattgtt gctcatgaaa 1140 

ttaaacaatt tattaataaa atgacagttc tgattcataa tttgaataca caatttaacc 1200 

cagacgctat ttatattaac tgtcctttaa ttaatgaatt accaaatatt ttaaatgaaa 1260 

ttaaagagca attctcctgt ttttctcaag gcagtccaat tcaattacat ttaactacta 1320 

atgtaaaaca agctacttta ttgggtggca ctttagcaat aatgcaaaaa acattaaata 1380 

taaataacat tcaaatgaat attaaataat tacagcagtc tgagttataa aatagatatc 1440 

tcggaccgac attaaattct tgacagggag agataggttt gatagaatat aatagttgtc 1500 

tcgagagtta gtttgtttat taaattaacc aactggatcc cgggtctaga ttagggtaac 1560 

tttgaaagga tattcctcat gcaagctaat cttcaatggt tagatgaccc agaagtcttc 1620 

cgggtcaacc aattacctgc acatagtgat caccattatt atcacgacac agcagaattc 1680 

aaaacgggta gtcgcttcat caagagtctc aatggcgctt ggcgttttaa cttcgccaag 1740 

acaccggctg aacgcccagt tgatttttat caacccgatt tcgatgcaac cgactttgat 1800 

acgattcaag ttcccggtca tattgaacta gccggctatg gtcaaattca atacattaac 1860 

acgctatacc catgggaagg taaaatttac cgtcgcccac cgtataccct caatcaagat 1920 

caattaacac caggcctatt cagcgacgct gcggacaaca ccgtcggctc gtacctcaaa 1980 

accttcgatc tcgacgatgt ttttaaaggg caacgtatta tcattcagtt ccaaggggta 2040 

gaagaagccc tgtacgtctg gttaaatggc cattttattg gctactctga agatagtttc 2100 

accccttcag aatttgattt gacgccgtat attcaggacc aaggtaacgt tttagcggtt 2160 

cgggtctaca aacacagtac tgctgccttt attgaagasc caagatatgt tccgtttctc 2220 

tggtattttc cgtgacgtca atatactggc ggagcctgct agccatatta ctgatttgga 2280 

catccgacca gttccaaatg ccaatctcaa aagtggtgag ctcaacatca ctactaaagt 2340 

aaccggcgaa ccagccactt tagcgctgac cgttaaagac catgacgggc gagtactgac 2400 

gagtcaaacg caaaccggta gtggcagtgt aacctttgat actatgttat tcgaccaact 2460 

gcacttgtgg tcaccacaaa cgccgtatct ctatcaattg acaattgaag tttacgatgc 2520 

tgatcaccaa ctcttggaag tcgtcccata tcagtttggg ttccggacgg tcgagctgcg 2580 

cgatgacaaa gtcatttacg tcaacaataa acggttggtg atcaacgggg ttaaccggca 2 640 

cgaatggaac gcccacaccg gtcgcgttat cagtatggct gatatgcgcg ctgatatcca 2700 

aaccatgtta gctaacaata tcaatgccga tcggacctgc cattatcctg atcaattacc 2760 

ttggtatcaa ttatgtgacg aggccggtat ctacctaatg gccgaaacca acctcgaatc 2820 

gcacgggtca tggcaaaaga tgggggctat cgagccttct tacaatgttc ctggcgataa 2880 

tccacactgg ccagcagcgg tgatcgaccg ggcccgttca aactacgaat ggtttaaaaa 2940 

ccacccatca atcatttttt ggtcacttgg caatgaatcg tatgctggcg aagatatcgc 3000 

ggcgatgcag gctttttata aagaacacga tgattcacga ctcgtccact acgaaggcgt 3060 

tttctacaca ccagaattaa aagatcgcat ttctgatgtt gaaagtcgga tgtacgaaaa 3120 

gccccaaaat attgtagctt acttggaaga taacccaacc aaacctttcc taaattgtga 3180 

atatatgcat gacatgggga attctctggg cggtatgcaa tcatataatg atttgatcga 3240 
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caagtatcca atgtatcaag gtggctttat 
tcatgaccca atcaccgacc aagacgtgct 
ctccgattat gcattctccg gtaacggctt 
aatgcaagag gtgaaatatt attatggctt 
gcgacgggag cttaggacta cagggggcta 
ggggacttga atcactcgtc gtcaacgata 
tgttttggcg ggcgacaacc gataatgatc 
agtggtacgc ggccgataag ttctcaactt 
aaccagtcac accgttacca atcgcgccac 
ccacgaaagt ctctctggct taccacttcg 
cagtgactta tacggtgaca gcagacggtc 
agtctgattt gccagagcta cccgcatttg 
ccggcttcga ctataccggt ttgtccggtg 
cacacgggca attccacgtt gacagtctgc 
gcggcatgca catgcaaact gaacaagtga 
ctgaccacga caacacacca ttcagtttaa 
tcagctgcct tccctatacc gctgctgaac 
cattagcacg gcgaacggtc ttatcaatct 
acagttgggg aacggacgta gaagccccat 
tcagctttaa cattcatttc taaaaattaa 
atttgccaga gcgttctttt aatatgtttt 
gggggggggg gggggggggg gggggggggg 
gggggggggg gggggggggg gggggggggg 
gggggggggg gggggggggg gggggggggg 

gggggggggg gggggggggg gggggggggg 
gggggggggg gggggggggg gggggggggg 
gggggggggg gggggggggg gggggggggg 

gggggggggg gggggggggg gggggggggg 
gggggggggg gggggggggg gggggggggg 
gggggggggg gggggggggg gggggggggg 
gggggggggg gggggggggg gggggggggg 
gggggggggg gggggggggg gggggggggg 
gggggggggg gggggggggg gggggggggg 
gggggggggg gggggggggg gggggggggg 
gggggggggg gggggggggg gggggggggg 
ggggggggtc gaccgatgcc cttgagagcc 
cggggcatga ctatcgtcgc cgcacttatg 
caggtgccgg cagcgctctg ggtcattttc 
atgatcggcc tgtcgcttgc ggtattcgga 
actggtcccg ccaccaaacg tttcggcgag 
gacgcgctgg gctacgtctt gctggcgttc 
atgattcttc tcgcttccgg cggcatcggg 
caggtagatg acgaccatca gggacagctt 
acttcgatca ctggaccgct gatcgtcacg 
aacgggttgg catggattgt aggcgccgcc 
cgcggtgcat ggagccgggc cacctcgacc 
gattcaccac tccaagaatt ggagccaatc 
ccaacccttg gcagaacata tccatcgcgt 
tctcgggcag cgttgggtcc tggccacggg 
cccggctagg ctggcggggt tgccttactg 
gaacgtgaag cgactgctgc tgcaaaacgt 
tcggtttccg tgtttcgtaa agtctggaaa 
gcccgcaaca gagagtggaa ccaaccggtg 
ccatgagcgg cctcatttct tattctgagt 
ccttccggtg ggcgcggggc atgactatcg 
tgcaactcgt aggacaggtg ccggcagcgc 
ccatacccac gccgaaacaa gcgccctgca 
gctgctggct accctgtgga acacctacat 
caggctctgg gaggcagaat aaatgatcat 
ctgagcaaac tggcctcagg catttgagaa 
taaaccggta aaccagcaat agacataagc 



ttgggacttt attgatcaag cactcttcgt 3300 
ccggtatggc ggtgatttcg acgaacgcca 3360 
aatgtttgcc gaccggacac caaaaccagc 3420 
acacaaataa tcaactacac gttatttacg 3480 
atttccacta cctctttagc tacgaacgtg 3540 
aagagtggct ctatcgtaca cccacgccca 3600 
acggtagcgg cttttcagtc aaatccgcac 3660 
gtcaagatat cgaattgacg gttgacgacc 3720 
tcaataacaa atacacggat cacgaaatcg 3780 
ttaccacgac cgttcctagt accatcgtca 3840 
agatcaatat cgccacccat tatagcggtc 3900 
gtctgcggtt tatcatgcca actaccgcga 3960 
agacttatcc tgaccggctg gctggcgcaa 4020 
cagtcacacc atacttggtc ccacaagaat 4080 
cagtaacgcg atcaacaaca caaaataacg 4140 
catttagcca aaccgatgca ccattcgcct 4200 
tagaaaacgc aacacacatg gaagaattac 4260 
acggtgccgt tcgtggggtc ggtggcattg 4320 
atcatatcct cgctaatcaa gacattgact 4380 
tttgatttca aaagaatcgc tccggcagtt 4440 
atggctgagc tttagtcctt tgaagtgggg 4500 
gggggggggg gggggggggg gggggggggg 4560 
gggggggggg gggggggggg gggggggggg 4 620 
gggggggggg gggggggggg gggggggggg 4 680 
gggggggggg gggggggggg gggggggggg 4740 
gggggggggg gggggggggg gggggggggg 4 800 
gggggggggg gggggggggg gggggggggg 4860 
gggggggggg gggggggggg gggggggggg 4 920 
gggggggggg gggggggggg gggggggggg 4980 
gggggggggg gggggggggg gggggggggg 5040 
gggggggggg gggggggggg gggggggggg 5100 
gggggggggg gggggggggg gggggggggg 5160 
gggggggggg gggggggggg gggggggggg 5220 
gggggggggg gggggggggg gggggggggg 5280 
gggggggggg gggggggggg gggggggggg 5340 
ttcaacccag tcagctcctt ccggtgggcg 5400 
actgtcttct ttatcatgca actcgtagga 5460 
ggcgaggacc gctttcgctg gagcgcgacg 5520 
atcttgcacg ccctcgctca agccttcgtc 5580 
aagcaggcca ttatcgccgg catggcggcc 5640 
gcgacgcgag gctggatggc cttccccatt 5700 
atgcccgcgt tgcaggccat gctgtccagg 5760 
caaggatcgc tcgcggctct taccagccta 5820 
gcgatttatg ccgcctcggc gagcacatgg 5880 
ctataccttg tctgcctccc cgcgttgcgt 5940 
tgaatggaag ccggcggcac ctcgctaacg 6000 
aattcttgcg gagaactgtg aatgcgcaaa 6060 
ccgccatctc cagcagccgc acgcggcgca 6120 
tgcgcatgat cgtgctcctg tcgttgagga 6180 
gttagcagaa tgaatcaccg atacgcgagc 6240 
ctgcgacctg agcaacaaca tgaatggtct 6300 
cgcggaagtc ccctacgtgc tgctgaagtt 6360 
ataccacgat actatgactg agagtcaacg 6420 
tacaacagtc cgcaccgctg tccggtagct 6480 
tcgccgcact tatgactgtc ttctttatca 6540 
ccaacagtcc cccggccacg gggcctgcca 6600 
ccattatgtt ccggatctgc atcgcaggat 6660 
ctgtattaac gaagcgctaa ccgtttttat 6720 
atcgtcaatt attacctcca cggggagagc 6780 
gcacacggtc acactgcttc cggtagtcaa 6840 
ggctatttaa cgaccctgcc ctgaaccgac 6900 
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gaccgggtcg aatttgcttt cgaatttctg 
gcgtagcacc aggcgtttaa gggcaccaat 
gccactcatc gcagtactgt tgtaattcat 
agacggcatg atgaacctga atcgccagcg 
atttgcccat ggtgaaaacg ggggcgaaga 
aactggtgaa actcacccag ggattggctg 
tagggaaata ggccaggttt tcaccgtaac 
actgccggaa atcgtcgtgg tattcactcc 
ggaaaacggt gtaacaaggg tgaacactat 
ccatacgaat tcagaggttt gatgactttg 
aaaaaaggcg aagcctatta tatatttatc 
tgaaaaaaaa ggcagtgttt tcgctagtta 
atacagtcac aaaaatcgat ttttgtgact 
tcgatgtaca cagtacgact tttgtatttg 
agaaaactca ctatgaaatg actttctcta 
tttatactat gattatacca gaaaaacaaa 
aattagaaaa acgcaaagta gtggagcata 
aaaaaacagc tttaaaaatg tttgagttag 
caaaaaataa tactgtttat cttttaaaat 
ctagcagtaa gcatagtcaa ttcaaagaag 
tcaatattaa agctgataag aaactaggta 
atgtaaaatg gaacgattat aacgatgaag 
cctatctgat tgatttaaag gcagaattta 
tcaatagcaa gtattcgatt attttgtatc 
aacattacag cgtgaaaggg ggacggagag 
caataaaagt gaaagaattg cgagaaataa 
cceattttga aactagagta ttaaaaaaag 
ttaatgtaac ctatgagaaa aagaaagcag 
ttgagaagaa acgcatggca gacgataaca 
acgataaaaa gcaaaaatca agaaatgaag 
aatatacaag acttttatct gaaaatttct 
ccacaatggt aggcttacag aagaatgttt 
gagggctaaa tggtgtcaaa gaccacttgt 
ctaaacgtaa tgtagcgaaa tatcttaaaa 
aaaggcagga cttaaaccat gagtgaagac 
ctaggactag ctaaacagac cgttcaatac 
caaaaaaaca atagtggttc gatcgtccta 
agactaacaa aaataaaaaa agttcaaccg 
taagcgttaa tagtcttttg ttaggaattc 
gtgaataaag gccggataaa acttgtgctt 
aatatccagc tgaacggtct ggttataggt 
atgttcttta cgatgccatt gggatatatc 
cattttagct tccttagctc ctgaaaatct 
tcttatttca ttatggtgaa agttggaacc 
ccaaaagttg gcccagggct tcccggtatc 
gaagtgatct tccgtcacag gtatttattc 
cttactgatt tagtgtatga tggtgttttt 
gctgtccctc ctgttcagct actgacgggg 
atcagcgcta gcggagtgta tactggctta 
agtgcttcat gtggcaggag aaaaaaggct 
aggatatatt ccgcttcctc gctcactgac 
gcggaaatgg cttacgaacg gggcggagat 
agggaagtga gagggccgcg gcaaagccgt 
atcacgaaat ctgacgctca aatcagtggt 
aggcgtttcc ccctggcggc tccctcgtgc 
gtgtcattcc gctgttatgg ccgcgtttgt 
taggcagttc gctccaagct ggactgtatg 
gccttatccg gtaactatcg tcttgagtcc 
gcagcagcca ctggtaattg atttagagga 
ctaaactgaa aggacaagtt ttggtgactg 
agagttggta gctcagagaa ccttcgaaaa 



ccattcatcc gcttattatc acttattcag 6960 

aactgcctta aaaaaattac gccccgccct 7020 

taagcattct gccgacatgg aagccatcac 7080 

gcatcagcac cttgtcgcct tgcgtataat 7140 

agttgtccat attggccacg tttaaatcaa 7200 

agacgaaaaa catattctca ataaaccctt 7260 

acgccacatc ttgcgaatat atgtgtagaa 7320 

agagcgatga aaacgtttca gtttgctcat 7380 

cccatatcac cagctcaccg tctttcattg 7440 

acctttggtt ttaaattttt gaaaaaaata 7500 

atatatattt taatcttttg ttcttttgcg 7560 

tagaaattaa acagtcacaa aaatcgatgt 7620 

ctatgcatcg atgtatacga gtcacaaaaa 7680 

tgtactgtat atagtataat aaaagcatag 7740 

tgctactact aaaacacgca aaggagcgta 7800 

acaaacaaaa gcaggtgctg accttgaatg 7860 

acgctttaat tcaaagtgtg gctaaaatgc 7 920 

ccgtgtcttg tattgatacc gaagaacccc 7980 

ctgaactttt taaatttttt gaagtatcgt 8040 

cagttaatta tatgcaaaag caagcctttt 8100 

ttgagtatga aagcattgtg cctattcctt 8160 

taacaatacg ttttgaccaa gccattatgc 8220 

ctcaatataa aatttccgaa ctacaaaaac 8280 

gttggttatc catgaattat aaccaatacg 8340 

ttgaacaagt ggaatcctac cgcaacccct 8400 

ctgatacaat aaatgaacat caacattttc 8460 

caattgaaga aatcaacgct cacacctctt 8520 

ggcggagcat tgactctatt gtctttcata 8580 

gttacaagtt gggagataaa gactatcaag 8640 

ctgacttact aaaacaggca atggaaagta 8700 

taattggcat gaatgatatt atggatacag 8760 

atccgcttta tgacgaactg aaagaattaa 8820 

cttatgttgc cagcaaaaaa gaagcctatt 8880 

aagccattga acaatatctg ccaacggtta 8 94 0 

ttaaaaacga taaaagagtt ggcggacgag 9000 

catttaaagt ttttgccgac cgaaaatcga 9060 

aaccttgaag aacaaaagtt tatcaaaaaa 9120 

accgaaaatc gacaagaaaa aatgaagaat 9180 

cggatgagca ttcatcaggc gggcaagaat 9240 

atttttcttt acggtcttta aaaaggccgt 9300 

acattgagca actgactgaa atgcctcaaa 9360 

aacggtggta tatccagtga tttttttctc 9420 

cgataactca aaaaatacgc ccggtagtga 9480 

tcttacgtgc cgatcaacgt ctcattttcg 9540 

aacagggaca ccaggattta tttattctgc 9600 

ggcgcaaagt gcgtcgggtg atgctgccaa 9660 

gaggtgctcc agtggcttct gtttctatca 9720 

tggtgcgtaa cggcaaaagc accgccggac 9780 

ctatgttggc actgatgagg gtgtcagtga 9840 

gcaccggtgc gtcagcagaa tatgtgatac 9900 

tcgctacgct cggtcgttcg actgcggcga 9960 

ttcctggaag atgccaggaa gatacttaac 10020 

ttttccatag gctccgcccc cctgacaagc 10080 

ggcgaaaccc gacaggacta taaagatacc 10140 

gctctcctgt tcctgccttt cggtttaccg 10200 

ctcattccac gcctgacact cagttccggg 10260 

cacgaacccc ccgttcagtc cgaccgctgc 10320 

aacccggaaa gacatgcaaa agcaccactg 10380 

gttagtcttg aagtcatgcg ccggttaagg 10440 

cgctcctcca agccagttac ctcggttcaa 10500 

accgccctgc aaggcggttt tttcgttttc 10560 
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agagcaagag attacgcgca gaccaaaacg atctcaagaa gatcatctta ttaatcagat 10620 
aaaatatttc tagatttcag tgcaatttat ctcttcaaat gtagcacctg aagtcagccc 10680 
catacgatat aagttgtaat tctcatgttt gacagcttat catcgataca aattccccgt 10740 
aggcgctagg gacctcttta gcttcttgga agctgtcagt agtatatcta ataatttatc 10800 
tccattccct ttagtaacgt gtaactttcc aaatttaaaa aagcgactca tagaattatt 10860 
tcctcccgtt aaataataga taactattaa aaatagacaa tacttgctca taagtaatgg 10920 
tacttaaatt gtttactttg gcgtgtttca ttgcttgatg aaactgattt ttagtaaaca 10980 
gttgacgata ttctcgattg acccattttg aaacaaagta cgtatatagc ttccaatatt 11040 
tatctggaac atctgtggta tggcgggtaa gttttattaa gacactgttt acttttggtt 11100 
taggatgaaa gcattccgct ggcagcttaa gcaattgctg aatcgagact tgagtgtgca 11160 
agagcaaccc tagtgttcgg tgaatatcca aggtacgctt gtagaatcct tcttcaacaa 11220 
tcagatagat gtcagacgca cggctttcaa aaaccacttt tttaataatt tgtgtgctta 11280 
aatggtaagg aatattccca acaattttat acctctgttt gttagggaat tgaaactgta 11340 
gaatatcttg gtgaattaaa gtgacacgag tattcagttt taatttttct gacgataagt 11400 
tgaatagatg actgtctaat tcaatagacg ttacctgttt acttatttta gccagtttcg 11460 
tcgttaaatg ccctttacct gttccaattt cgtaaacggt atcggtttct tttaaattca 11520 
attgttttat tatttggttg agtacttttt cactcgttaa aaagttttga gaatatttta 11580 
tatttttgtt catgtaatct ctcctgaagt gattacatct ataaacaaat acagaagtta 11640 
aacgatttgt ttgtaatttt agttatctgt ttaaaaagtc ataagattag tcactggtag 11700 
gaattaatct aacgtattta tttatctgcg taatcactgt ttttagtctg tttcaaaaca 11760 
gtagatgttt tatctacatt acgcatttgg aataccaaca tgacgaatcc ctccttctta 11820 
attacaaatt tttagcatct aatttaactt caattcctat tatacaaaat tttaagataa 11880 
tgcactatca acacactctt aagtttgctt ctaagtctta tttccataac ttcttttacg 11940 
tttccgccat tctttgctgt ttcgattttt atgatatggt gcaagtcagc acgaacacga 12000 
accgtcttat ctcccattat atcttttttt gcactgattg gtgtatcatt tcgtttttct 12060 
ttttgtgcgc ctaaatttcc cacaatcact cacttctttc tatttcttct tattcttatt 12120 
ttatcatcaa caatcacaaa tcacttgtga tttgtgataa gtgatttgtg ataagtgatt 12180 
tgtgattaat atataaaagc cctctttaaa gggcttttat gtttattttg agaaagatat 12240 
aaaatcaata tatccctttt ccccaatttt tacaacggca ttgtagggct ttttctcttt 12300 
gtttttgatt tccttttacc agggtttctt ttccctccag taattctttt acattcgttt 12360 
tggttagttt tttctttcta aaatgttcag ctaaagtaaa cttacattca ggataatttg 12420 
aacaaccata aaacgatttt tttaatacaa tattgttgcc acacttagga cattttccta 12480 
caatactttt ttctgcttct ttttctttct gttcctggta atcagaaaaa tttagttttt 12540 
ctatatcgtt aggtacagct tccagtaaat gaacaatgaa ttttttgata ttcgtaataa 12600 
agttctcttg attgccttct cttttaccga ttttttttaa atacgtttcc catttagccg 12660 
tcatttcagc actcgttaaa aggtgctgac tttcaactgc ctggcacaat aattttcctt 12720 
tttcagttac aac 12733 



<210> 48 
<211> 9417 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Vector pEPEF14* 
<400> 48 

acccgatgcc cttgagagcc ttcaacccag tcagctcctt ccggtgggcg cggggcatga 60 
ctatcgtcgc cgcacttatg actgtcttct ttatcatgca actcgtagga caggtgccgg 120 
cagcgctctg ggtcattttc ggcgaggacc gctttcgctg gagcgcgacg atgatcggcc 180 
tgtcgcttgc ggtattcgga atcttgcacg ccctcgctca agccttcgtc actggtcccg 24 0 
ccaccaaacg tttcggcgag aagcaggcca ttatcgccgg catggcggcc gacgcgctgg 300 
gctacgtctt gctggcgttc gcgacgcgag gctggatggc cttccccatt atgattcttc 360 
tcgcttccgg cggcatcggg atgcccgcgt tgcaggccat gctgtccagg caggtagatg 420 
acgaccatca gggacagctt caaggatcgc tcgcggctct taccagccta acttcgatca 480 
ctggaccgct gatcgtcacg gcgatttatg ccgcctcggc gagcacatgg aacgggttgg 540 
catggattgt aggcgccgcc ctataccttg tctgcctccc cgcgttgcgt cgcggtgcat 600 
ggagccgggc cacctcgacc tgaatggaag ccggcggcac ctcgctaacg gattcaccac 660 
tccaagaatt ggagccaatc aattcttgcg gagaactgtg aatgcgcaaa ccaacccttg 720 
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gcagaacata tccatcgcgt ccgccatctc 
cgttgggtcc tggccacggg tgcgcatgat 
ctggcggggt tgccttactg gttagcagaa 
cgactgctgc tgcaaaacgt ctgcgacctg 
tgtttcgtaa agtctggaaa cgcggaagtc 
gagagtggaa ccaaccggtg ataccacgat 
cctcatttct tattctgagt tacaacagtc 
ggcgcggggc atgactatcg tcgccgcact 
aggacaggtg ccggcagcgc ccaacagtcc 
gccgaaacaa gcgccctgca ccattatgtt 
accctgtgga acacctacat ctgtattaac 
gaggcagaat aaatgatcat atcgtcaatt 
tggcctcagg catttgagaa gcacacggtc 
aaccagcaat agacataagc ggctatttaa 
aatttgcttt cgaatttctg ccattcatcc 
aggcgtttaa gggcaccaat aactgcctta 
gcagtactgt tgtaattcat taagcattct 
atgaacctga atcgccagcg gcatcagcac 
ggtgaaaacg ggggcgaaga agttgtccat 
actcacccag ggattggctg agacgaaaaa 
ggccaggttt tcaccgtaac acgccacatc 
atcgtcgtgg tattcactcc agagcgatga 
gtaacaaggg tgaacactat cccatatcac 
ggtttgatga ctttgacctt tggttttaaa 
tattatatat ttatcatata tattttaatc 
tgttttcgct agttatagaa attaaacagt 
tcgatttttg tgactctatg catcgatgta 
cgacttttgt atttgtgtac tgtatatagt 
aaatgacttt ctctatgcta ctactaaaac 
taccagaaaa acaaaacaaa caaaagcagg 
aagtagtgga gcataacgct ttaattcaaa 
aaatgtttga gttagccgtg tcttgtattg 
tttatctttt aaaatctgaa ctttttaaat 
gtcaattcaa agaagcagtt aattatatgc 
ataagaaact aggtattgag tatgaaagca 
attataacga tgaagtaaca atacgttttg 
taaaggcaga atttactcaa tataaaattt 
cgattatttt gtatcgttgg ttatccatga 
aagggggacg gagagttgaa caagtggaat 
aattgcgaga aataactgat acaataaatg 
gagtattaaa aaaagcaatt gaagaaatca 
agaaaaagaa agcagggcgg agcattgact 
tggcagacga taacagttac aagttgggag 
aatcaagaaa tgaagctgac ttactaaaac 
tatctgaaaa tttcttaatt ggcatgaatg 
tacagaagaa tgtttatccg ctttatgacg 
tcaaagacca cttgtcttat gttgccagca 
cgaaatatct taaaaaagcc attgaacaat 
accatgagtg aagacttaaa aacgataaaa 
cagaccgttc aataccattt aaagtttttg 
ggttcgatcg tcctaaacct tgaagaacaa 
aaaaaagttc aaccgaccga aaatcgacaa 
ttttgttagg cggatgagca ttcatcaggc 
acttgtgctt atttttcttt acggtcttta 
ggttataggt acattgagca actgactgaa 
gggatatatc aacggtggta tatccagtga 
ctgaaaatct cgataactca aaaaatacgc 
agttggaacc tcttacgtgc cgatcaacgt 
tcccggtatc aacagggaca ccaggattta 
gtatttattc ggcgcaaagt gcgtcgggtg 
tggtgttttt gaggtgctcc agtggcttct 



cagcagccgc acgcggcgca tctcgggcag 780 
cgtgctcctg tcgttgagga cccggctagg 840 
tgaatcaccg atacgcgagc gaacgtgaag 900 
agcaacaaca tgaatggtct tcggtttccg 960 
ccctacgtgc tgctgaagtt gcccgcaaca 1020 
actatgactg agagtcaacg ccatgagcgg 1080 
cgcaccgctg tccggtagct ccttccggtg 1140 
tatgactgtc ttctttatca tgcaactcgt 1200 
cccggccacg gggcctgcca ccatacccac 1260 
ccggatctgc atcgcaggat gctgctggct 1320 
gaagcgctaa ccgtttttat caggctctgg 1380 
attacctcca cggggagagc ctgagcaaac 1440 
acactgcttc cggtagtcaa taaaccggta 1500 
cgaccctgcc ctgaaccgac gaccgggtcg 1560 
gcttattatc acttattcag gcgtagcacc 1620 
aaaaaattac gccccgccct gccactcatc 1680 
gccgacatgg aagccatcac agacggcatg* 1740 
cttgtcgcct tgcgtataat atttgcccat 1800 
attggccacg tttaaatcaa aactggtgaa 18 60 
catattctca ataaaccctt tagggaaata 1920 
ttgcgaatat atgtgtagaa actgccggaa 1980 
aaacgtttca gtttgctcat ggaaaacggt 2040 
cagctcaccg tctttcattg ccatacgaga 2100 
tttttgaaaa aaataaaaaa aggcgaagcc 2160 
ttttgttctt ttgcgtgaaa aaaaaggcag 2220 
cacaaaaatc gatgtataca gtcacaaaaa 2280 
tacgagtcac aaaaatcgat gtacacagta 2340 
ataataaaag catagagaaa actcactatg 2400 
acgcaaagga gcgtatttat actatgatta 2460 
tgctgacctt gaatgaatta gaaaaacgca 2520 
gtgtggctaa aatgcaaaaa acagctttaa 2580 
ataccgaaga acccccaaaa aataatactg 2640 
tttttgaagt atcgtctagc agtaagcata 2700 
aaaagcaagc ctttttcaat attaaagctg 2760 
ttgtgcctat tccttatgta aaatggaacg 2820 
accaagccat tatgccctat ctgattgatt 2880 
ccgaactaca aaaactcaat agcaagtatt 2940 
attataacca atacgaacat tacagcgtga 3000 
cctaccgcaa cccctcaata aaagtgaaag 3060 
aacatcaaca ttttccccat tttgaaacta 3120 
acgctcacac ctcttttaat gtaacctatg 3180 
ctattgtctt tcatattgag aagaaacgca 3240 
ataaagacta tcaagacgat aaaaagcaaa 3300 
aggcaatgga aagtaaatat acaagacttt 3360 
atattatgga tacagccaca atggtaggct 3420 
aactgaaaga attaagaggg ctaaatggtg 3480 
aaaaagaagc ctattctaaa cgtaatgtag 3540 
atctgccaac ggttaaaagg caggacttaa 3600 
gagttggcgg acgagctagg actagctaaa 3660 
ccgaccgaaa atcgacaaaa aaacaatagt 3720 
aagtttatca aaaaaagact aacaaaaata 3780 
gaaaaaatga agaattaagc gttaatagtc 3840 
gggcaagaat gtgaataaag gccggataaa 3900 
aaaaggccgt aatatccagc tgaacggtct 3960 
atgcctcaaa atgttcttta cgatgccatt 4 020 
tttttttctc cattttagct tccttagctc 4080 
ccggtagtga tcttatttca ttatggtgaa 4140 
ctcattttcg ccaaaagttg gcccagggct 4 200 
tttattctgc gaagtgatct tccgtcacag 4260 
atgctgccaa cttactgatt tagtgtatga 4 320 
gtttctatca gctgtccctc ctgttcagct 4 380 
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actgacgggg tggtgcgtaa cggcaaaagc 
tactggctta ctatgttggc actgatgagg 
aaaaaaggct gcaccggtgc gtcagcagaa 
gctcactgac tcgctacgct cggtcgttcg 
gggcggagat ttcctggaag atgccaggaa 
gcaaagccgt ttttccatag gctccgcccc 
aatcagtggt ggcgaaaccc gacaggacta 
tccctcgtgc gctctcctgt tcctgccttt 
ccgcgtttgt ctcattccac gcctgacact 
ggactgtatg cacgaacccc ccgttcagtc 
tcttgagtcc aacccggaaa gacatgcaaa 
atttagagga gttagtcttg aagtcatgcg 
ttggtgactg cgctcctcca agccagttac 
ccttcgaaaa accgccctgc aaggcggttt 
gaccaaaacg atctcaagaa gatcatctta 
tgcaatttat ctcttcaaat gtagcacctg 
tctcatgttt gacagcttat catcgataca 
gcttcttgga agctgtcagt agtatatcta 
gtaactttcc aaatttaaaa aagcgactca 
taactattaa aaatagacaa tacttgctca 
gcgtgtttca ttgcttgatg aaactgattt 
acccattttg aaacaaagta cgtatatagc 
tggcgggtaa gttttattaa gacactgttt 
ggcagcttaa gcaattgctg aatcgagact 
tgaatatcca aggtacgctt gtagaatcct 
cggctttcaa aaaccacttt tttaataatt 
acaattttat acctctgttt gttagggaat 
gtgacacgag tattcagttt taatttttct 
tcaatagacg ttacctgttt acttatttta 
gttccaattt cgtaaacggt atcggtttct 
agtacttttt cactcgttaa aaagttttga 
ctcctgaagt gattacatct ataaacaaat 
agttatctgt ttaaaaagtc ataagattag 
tttatctgcg taatcactgt ttttagtctg 
acgcatttgg aataccaaca tgacgaatcc 
aatttaactt caattcctat tatacaaaat 
aagtttgctt ctaagtctta tttccataac 
ttcgattttt atgatatggt gcaagtcagc 
atcttttttt gcactgattg gtgtatcatt 
cacaatcact cacttctttc tatttcttct 
tcacttgtga tttgtgataa gtgatttgtg 
cctctttaaa gggcttttat gtttattttg 
ccccaatttt tacaacggca ttgtagggct 
agggtttctt ttccctccag taattctttt 
aaatgttcag ctaaagtaaa cttacattca 
tttaatacaa tattgttgcc acacttagga 
ttttctttct gttcctggta atcagaaaaa 
tccagtaaat gaacaatgaa ttttttgata 
cttttaccga ttttttttaa atacgtttcc 
aggtgctgac tttcaactgc ctggcacaat 
aggtcgacga taaccacata acagtcataa 
agaaatattt tggcattcta cgactataac 
taattgatta tagaataatg ttgctcatat 
ctatgttaca cttggtaata ttaaccgaac 
aataaatgga aaataatttt atagtaaatg 
tcaataacag caatatttca cgaacacaaa 
ctatttctaa cattctgaac aacttaaaac 
gtaatagtac taaaagtggt ggacgaaagc 
gctactatat ttctatggat ttaacatatg 
atgctactat attaaagcaa gattcctacg 
tacaaatttt aaaatctaat ataaacgtct 



accgccggac atcagcgcta gcggagtgta 4440 
gtgtcagtga agtgcttcat gtggcaggag 4500 
tatgtgatac aggatatatt ccgcttcctc 4560 
actgcggcga gcggaaatgg cttacgaacg 4 620 
gatacttaac agggaagtga gagggccgcg 4680 
cctgacaagc atcacgaaat ctgacgctca 4740 
taaagatacc aggcgtttcc ccctggcggc 4800 
cggtttaccg gtgtcattcc gctgttatgg 4860 
cagttccggg taggcagttc gctccaagct 4 920 
cgaccgctgc gccttatccg gtaactatcg 4 980 
agcaccactg gcagcagcca ctggtaattg 504 0 
ccggttaagg ctaaactgaa aggacaagtt 5100 
ctcggttcaa agagttggta gctcagagaa 5160 
tttcgttttc agagcaagag attacgcgca 5220 
ttaatcagat aaaatatttc tagatttcag 5280 
aagtcagccc catacgatat aagttgtaat 5340 
aattccccgt aggcgctagg gacctcttta 5400 
ataatttatc tccattccct ttagtaacgt 54 60 
tagaattatt tcctcccgtt aaataataga 5520 
taagtaatgg tacttaaatt gtttactttg 5580 
ttagtaaaca gttgacgata ttctcgattg 5640 
ttccaatatt tatctggaac atctgtggta 5700 
acttttggtt taggatgaaa gcattccgct 5760 
tgagtgtgca agagcaaccc tagtgttcgg 5820 
tcttcaacaa tcagatagat gtcagacgca 5880 
tgtgtgctta aatggtaagg aatattccca 5940 
tgaaactgta gaatatcttg gtgaattaaa 6000 
gacgataagt tgaatagatg actgtctaat 6060 
gccagtttcg tcgttaaatg ccctttacct 6120 
tttaaattca attgttttat tatttggttg 6180 
gaatatttta tatttttgtt catgtaatct 6240 
acagaagtta aacgatttgt ttgtaatttt 6300 
tcactggtag gaattaatct aacgtattta 6360 
tttcaaaaca gtagatgttt tatctacatt 6420 
ctccttctta attacaaatt tttagcatct 6480 
tttaagataa tgcactatca acacactctt 6540 
ttcttttacg tttccgccat tctttgctgt 6600 
acgaacacga accgtcttat ctcccattat 6660 
tcgtttttct ttttgtgcgc ctaaatttcc 6720 
tattcttatt ttatcatcaa caatcacaaa 6780 
ataagtgatt tgtgattaat atataaaagc 6840 
agaaagatat aaaatcaata tatccctttt 6900 
ttttctcttt gtttttgatt tccttttacc 6960 
acattcgttt tggttagttt tttctttcta 7020 
ggataatttg aacaaccata aaacgatttt 7080 
cattttccta caatactttt ttctgcttct 7140 
tttagttttt ctatatcgtt aggtacagct 7200 
ttcgtaataa agttctcttg attgccttct 7260 
catttagccg tcatttcagc actcgttaaa 7320 
aattttcctt tttcagttac aacagcctgc 7380 
aaccactcct ttttaacaaa ctttatcaca 7440 
ttaaatttat attttttact ttataatata 7500 
cgtttgccaa catctagtac tcaaattaca 7560 
ttcccctgtc caaattagat aagaggtaat 7620 
aaaatgagaa gcgtgtatta aaacaaattt 7680 
tatcgaagaa tttagaactt aataaagcta 7740 
acaagagttt agttaatgaa gtaggagaag 7800 
ctattttact cgaaattaac caaaaatatg 7860 
attccgttga attaatgtac aactactttg 7920 
aattaaatga taaaaatgta agcagtatat 7980 
cagaaaaata tgatacgtta tatgggttac 8040 
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ttggtatatc tatatccata cacggtatcg ttgacgatga gcaaaacata atcaatcttc 8100 

cttttcataa aaatgagaaa cgcacattta ccgatgaatt aaagtcattc acaaatgttc 8160 

ctgtcgttat agaaaatgaa gcaaatttat cagcgctata tgaaaaaagt ttatatatta 8220 

attcaaacat aaataatttg attactttaa gtattcacaa gggtataggc gctggcatca 8280 

taataaataa aaaactttat cgtggctcaa atggagaggc tggagagata ggtaagacat 8340 

tggttttgga atctataaat aacaatgaca acaaatatta taaaatcgaa gatatatgct 8400 

cccaagacgc tttaatacag aaaataaata ataggttggg cgtcacattg acgtttacag 84 60 

aactaatcca atattacaac gaaggaaatt caattgttgc tcatgaaatt aaacaattta 8520 

ttaataaaat gacagttctg attcataatt tgaatacaca atttaaccca gacgctattt 8580 

atattaactg tcctttaatt aatgaattac caaatatttt aaatgaaatt aaagagcaat 8640 

tctcctgttt ttctcaaggc agtccaattc aattacattt aactactaat gtaaaacaag 8700 

ctactttatt gggtggcact ttagcaataa tgcaaaaaac attaaatata aataacattc 87 60 

aaatgaatat taaataatta cagcagtctg agttataaaa tagatatctc ggaccgaaaa 8820 

atgacagttt attcttgaca gggagagata ggtttgatat aatataatag ttgtctcgag 8880 

agttagtttg tttattaaat taaccaactg gacgaattcc cggggatccg tcgacctgca 8940 

gccaagcttg ggcttttcag cctgatacag attaaatcag aacgcagaag cggtctgata 9000 

aaacagaatt tgcctggcgg cagtagcgcg gtggtcccac ctgaccccat gccgaactca 9060 

gaagtgaaac gccgtagcgc cgatggtagt gtggggtctc cccatgcgag agtagggaac 9120 

tgccaggcat caaataaaac gaaaggctca gtcgaaagac tgggcctttc gttttatctg 9180 

ttgtttgtcg gtgaacgctc tcctgagtag gacaaatccg ccgggagcgg atttgaacgt 9240 

tgcgaagcaa cggcccggag ggtggcgggc aggacgcccg ccataaactg ccaggcatca 9300 

aattaagcag aaggccatcc tgacggatgg cctttttgcg tttctacaaa ctctttttgt 9360 

ttatttttct aaatacattc aaatatgtat ccgctgatcc ccatcctatc gatcccc 9417 



<210> 49 
<211> 9484 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Vector P1P2/EF14 
<400> 49 

acccgatgcc cttgagagcc ttcaacccag tcagctcctt ccggtgggcg cggggcatga 60 

ctatcgtcgc cgcacttatg actgtcttct ttatcatgca actcgtagga caggtgccgg 120 

cagcgctctg ggtcattttc ggcgaggacc gctttcgctg gagcgcgacg atgatcggcc 180 

tgtcgcttgc ggtattcgga atcttgcacg ccctcgctca agccttcgtc actggtcccg 240 

ccaccaaacg tttcggcgag aagcaggcca ttatcgccgg catggcggcc gacgcgctgg 300 

gctacgtctt gctggcgttc gcgacgcgag gctggatggc cttccccatt atgattcttc 360 

tcgcttccgg cggcatcggg atgcccgcgt tgcaggccat gctgtccagg caggtagatg 420 

acgaccatca gggacagctt caaggatcgc tcgcggctct taccagccta acttcgatca 480 

ctggaccgct gatcgtcacg gcgatttatg ccgcctcggc gagcacatgg aacgggttgg 540 

catggattgt aggcgccgcc ctataccttg tctgcctccc cgcgttgcgt cgcggtgcat 600 

ggagccgggc cacctcgacc tgaatggaag ccggcggcac ctcgctaacg gattcaccac 660 

tccaagaatt ggagccaatc aattcttgcg gagaactgtg aatgcgcaaa ccaacccttg 720 

gcagaacata tccatcgcgt ccgccatctc cagcagccgc acgcggcgca tctcgggcag 780 

cgttgggtcc tggccacggg tgcgcatgat cgtgctcctg tcgttgagga cccggctagg 840 

ctggcggggt tgccttactg gttagcagaa tgaatcaccg atacgcgagc gaacgtgaag 900 

cgactgctgc tgcaaaacgt ctgcgacctg agcaacaaca tgaatggtct tcggtttccg 960 

tgtttcgtaa agtctggaaa cgcggaagtc ccctacgtgc tgctgaagtt gcccgcaaca 1020 

gagagtggaa ccaaccggtg ataccacgat actatgactg agagtcaacg ccatgagcgg 1080 

cctcatttct tattctgagt tacaacagtc cgcaccgctg tccggtagct ccttccggtg 1140 

ggcgcggggc atgactatcg tcgccgcact tatgactgtc ttctttatca tgcaactcgt 1200 

aggacaggtg ccggcagcgc ccaacagtcc cccggccacg gggcctgcca ccatacccac 1260 

gccgaaacaa gcgccctgca ccattatgtt ccggatctgc atcgcaggat gctgctggct 1320 

accctgtgga acacctacat ctgtattaac gaagcgctaa ccgtttttat caggctctgg 1380 

gaggcagaat aaatgatcat atcgtcaatt attacctcca cggggagagc ctgagcaaac 1440 

tggcctcagg catttgagaa gcacacggtc acactgcttc cggtagtcaa taaaccggta 1500 

aaccagcaat agacataagc ggctatttaa cgaccctgcc ctgaaccgac gaccgggtcg 1560 
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aatttgcttt cgaatttctg ccattcatcc 
aggcgtttaa gggcaccaat aactgcctta 
gcagtactgt tgtaattcat taagcattct 
atgaacctga atcgccagcg gcatcagcac 
ggtgaaaacg ggggcgaaga agttgtccat 
actcacccag ggattggctg agacgaaaaa 
ggccaggttt tcaccgtaac acgccacatc 
atcgtcgtgg tattcactcc agagcgatga 
gtaacaaggg tgaacactat cccatatcac 
tcagaggttt gatgactttg acctttggtt 
aagcctatta tatatttatc atatatattt 
ggcagtgttt tcgctagtta tagaaattaa 
aaaaatcgat ttttgtgact ctatgcatcg 
cagtacgact tttgtatttg tgtactgtat 
ctatgaaatg actttctcta tgctactact 
gattatacca gaaaaacaaa acaaacaaaa 
acgcaaagta gtggagcata acgctttaat 
tttaaaaatg tttgagttag ccgtgtcttg 
tactgtttat cttttaaaat ctgaactttt 
gcatagtcaa ttcaaagaag cagttaatta 
agctgataag aaactaggta ttgagtatga 
gaacgattat aacgatgaag taacaatacg 
tgatttaaag gcagaattta ctcaatataa 
gtattcgatt attttgtatc gttggttatc 
cgtgaaaggg ggacggagag ttgaacaagt 
gaaagaattg cgagaaataa ctgatacaat 
aactagagta ttaaaaaaag caattgaaga 
ctatgagaaa aagaaagcag ggcggagcat 
acgcatggca gacgataaca gttacaagtt 
gcaaaaatca agaaatgaag ctgacttact 
acttttatct gaaaatttct taattggcat 
aggcttacag aagaatgttt atccgcttta 
tggtgtcaaa gaccacttgt cttatgttgc 
tgtagcgaaa tatcttaaaa aagccattga 
cttaaaccat gagtgaagac ttaaaaacga 
ctaaacagac cgttcaatac catttaaagt 
atagtggttc gatcgtccta aaccttgaag 
aaataaaaaa agttcaaccg accgaaaatc 
tagtcttttg ttaggaattc cggatgagca 
gccggataaa acttgtgctt atttttcttt 
tgaacggtct ggttataggt acattgagca 
cgatgccatt gggatatatc aacggtggta 
tccttagctc ctgaaaatct cgataactca 
ttatggtgaa agttggaacc tcttacgtgc 
gcccagggct tcccggtatc aacagggaca 
tccgtcacag gtatttattc ggcgcaaagt 
tagtgtatga tggtgttttt gaggtgctcc 
ctgttcagct actgacgggg tggtgcgtaa 
gcggagtgta tactggctta ctatgttggc 
gtggcaggag aaaaaaggct gcaccggtgc 
ccgcttcctc gctcactgac tcgctacgct 
cttacgaacg gggcggagat ttcctggaag 
gagggccgcg gcaaagccgt ttttccatag 
ctgacgctca aatcagtggt ggcgaaaccc 
ccctggcggc tccctcgtgc gctctcctgt 
gctgttatgg ccgcgtttgt ctcattccac 
gctccaagct ggactgtatg cacgaacccc 
gtaactatcg tcttgagtcc aacccggaaa 
ctggtaattg atttagagga gttagtcttg 
aggacaagtt ttggtgactg cgctcctcca 
gctcagagaa ccttcgaaaa accgccctgc 



gcttattatc acttattcag. gcgtagcacc 1620 
aaaaaattac gccccgccct gccactcatc 1680 
gccgacatgg aagccatcac agacggcatg 1740 
cttgtcgcct tgcgtataat atttgcccat 1800 
attggccacg tttaaatcaa aactggtgaa 1860 
catattctca ataaaccctt tagggaaata 1920 
ttgcgaatat atgtgtagaa actgccggaa 1980 
aaacgtttca gtttgctcat ggaaaacggt 2040 
cagctcaccg tctttcattg ccatacgaat 2100 
ttaaattttt gaaaaaaata aaaaaaggcg 2160 
taatcttttg ttcttttgcg tgaaaaaaaa 2220 
acagtcacaa aaatcgatgt atacagtcac 2280 
atgtatacga gtcacaaaaa tcgatgtaca 2340 
atagtataat aaaagcatag agaaaactca 2400 
aaaacacgca aaggagcgta tttatactat 24 60 
gcaggtgctg accttgaatg aattagaaaa 2520 
tcaaagtgtg gctaaaatgc aaaaaacagc 2580 
tattgatacc gaagaacccc caaaaaataa 2640 
taaatttttt gaagtatcgt ctagcagtaa 2700 
tatgcaaaag caagcctttt tcaatattaa 2760 
aagcattgtg cctattcctt atgtaaaatg 2820 
ttttgaccaa gccattatgc cctatctgat 2880 
aatttccgaa ctacaaaaac tcaatagcaa 2940 
catgaattat aaccaatacg aacattacag 3000 
ggaatcctac cgcaacccct caataaaagt 3060 
aaatgaacat caacattttc cccattttga 3120 
aatcaacgct cacacctctt ttaatgtaac 3180 
tgactctatt gtctttcata ttgagaagaa 3240 
gggagataaa gactatcaag acgataaaaa 3300 
aaaacaggca atggaaagta aatatacaag 3360 
gaatgatatt atggatacag ccacaatggt 3420 
tgacgaactg aaagaattaa gagggctaaa 3480 
cagcaaaaaa gaagcctatt ctaaacgtaa 3540 
acaatatctg ccaacggtta aaaggcagga 3600 
taaaagagtt ggcggacgag ctaggactag 3660 
ttttgccgac cgaaaatcga caaaaaaaca 3720 
aacaaaagtt tatcaaaaaa agactaacaa 3780 
gacaagaaaa aatgaagaat taagcgttaa 3840 
ttcatcaggc gggcaagaat gtgaataaag 3900 
acggtcttta aaaaggccgt aatatccagc 3960 
actgactgaa atgcctcaaa atgttcttta 4020 
tatccagtga tttttttctc cattttagct 4080 
aaaaatacgc ccggtagtga tcttatttca 4140 
cgatcaacgt ctcattttcg ccaaaagttg 4200 
ccaggattta tttattctgc gaagtgatct 4260 
gcgtcgggtg atgctgccaa cttactgatt 4320 
agtggcttct gtttctatca gctgtccctc 4380 
cggcaaaagc accgccggac atcagcgcta 4 4 40 
actgatgagg gtgtcagtga agtgcttcat 4500 
gtcagcagaa tatgtgatac aggatatatt 4560 
cggtcgttcg actgcggcga gcggaaatgg 4 620 
atgccaggaa gatacttaac agggaagtga 4 680 
gctccgcccc cctgacaagc atcacgaaat 4740 
gacaggacta taaagatacc aggcgtttcc 4800 
tcctgccttt cggtttaccg gtgtcattcc 4860 
gcctgacact cagttccggg taggcagttc 4 920 
ccgttcagtc cgaccgctgc gccttatccg 4 980 
gacatgcaaa agcaccactg gcagcagcca 5040 
aagtcatgcg ccggttaagg ctaaactgaa 5100 
agccagttac ctcggttcaa agagttggta 5160 
aaggcggttt tttcgttttc agagcaagag 5220 
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attacgcgca gaccaaaacg atctcaagaa 
tagatttcag tgcaatttat ctcttcaaat 
aagttgtaat tctcatgttt gacagcttat 
gacctcttta gcttcttgga agctgtcagt 
ttagtaacgt gtaactttcc aaatttaaaa 
aaataataga taactattaa aaatagacaa 
gtttactttg gcgtgtttca ttgcttgatg 
ttctcgattg acccattttg aaacaaagta 
atctgtggta tggcgggtaa gttttattaa 
gcattccgct ggcagcttaa gcaattgctg 
tagtgttcgg tgaatatcca aggtacgctt 
gtcagacgca cggctttcaa aaaccacttt 
aatattccca acaattttat acctctgttt 
gtgaattaaa gtgacacgag tattcagttt 
actgtctaat tcaatagacg ttacctgttt 
ccctttacct gttccaattt cgtaaacggt 
tatttggttg agtacttttt cactcgttaa 
catgtaatct ctcctgaagt gattacatct 
ttgtaatttt agttatctgt ttaaaaagtc 
aacgtattta tttatctgcg taatcactgt 
tatctacatt acgcatttgg aataccaaca 
tttagcatct aatttaactt caattcctat 
acacactctt aagtttgctt ctaagtctta 
tctttgctgt ttcgattttt atgatatggt 
ctcccattat atcttttttt gcactgattg 
ctaaatttcc cacaatcact cacttctttc 
caatcacaaa tcacttgtga tttgtgataa 
atataaaagc cctctttaaa gggcttttat 
tatccctttt ccccaatttt tacaacggca 
tccttttacc agggtttctt ttccctccag 
tttctttcta aaatgttcag ctaaagtaaa 
aaacgatttt tttaatacaa tattgttgcc 
ttctgcttct ttttctttct gttcctggta 
aggtacagct tccagtaaat gaacaatgaa 
attgccttct cttttaccga ttttttttaa 
actcgttaaa aggtgctgac tttcaactgc 
aacagcctgc aggtcgacga taaccacata 
ctttatcaca agaaatattt tggcattcta 
ttataatata taattgatta tagaataatg 
tcaaattaca ctatgttaca cttggtaata 
aagaggtaat aataaatgga aaataatttt 
aaacaaattt tcaataacag caatatttca 
aataaagcta ctatttctaa cattctgaac 
gtaggagaag gtaatagtac taaaagtggt 
caaaaatatg gctactatat ttctatggat 
aactactttg atgctactat attaaagcaa 
agcagtatat tacaaatttt aaaatctaat 
tatgggttac ttggtatatc tatatccata 
atcaatcttc cttttcataa aaatgagaaa 
acaaatgttc ctgtcgttat agaaaatgaa 
ttatatatta attcaaacat aaataatttg 
gctggcatca taataaataa aaaactttat 
ggtaagacat tggttttgga atctataaat 
gatatatgct cccaagacgc tttaatacag 
acgtttacag aactaatcca atattacaac 
aaacaattta ttaataaaat gacagttctg 
gacgctattt atattaactg tcctttaatt 
aaagagcaat tctcctgttt ttctcaaggc 
gtaaaacaag ctactttatt gggtggcact 
aataacattc aaatgaatat taaataatta 
ggaccggtct agttattctt gacaaaaaat 



gatcatctta ttaatcagat aaaatatttc 5280 
gtagcacctg aagtcagccc catacgatat 5340 
catcgataca aattccccgt aggcgctagg 5400 
agtatatcta ataatttatc tccattccct 54 60 
aagcgactca tagaattatt tcctcccgtt 5520 
tacttgctca taagtaatgg tacttaaatt 5580 
aaactgattt ttagtaaaca gttgacgata 5640 
cgtatatagc ttccaatatt tatctggaac 5700 
gacactgttt acttttggtt taggatgaaa 5760 
aatcgagact tgagtgtgca agagcaaccc 5820 
gtagaatcct tcttcaacaa tcagatagat 5880 
tttaataatt tgtgtgctta aatggtaagg 5940 
gttagggaat tgaaactgta gaatatcttg 6000 
taatttttct gacgataagt tgaatagatg 6060 
acttatttta gccagtttcg tcgttaaatg 6120 
atcggtttct tttaaattca attgttttat 6180 
aaagttttga gaatatttta tatttttgtt 6240 
ataaacaaat acagaagtta aacgatttgt 6300 
ataagattag tcactggtag gaattaatct 6360 
ttttagtctg tttcaaaaca gtagatgttt 6420 
tgacgaatcc ctccttctta attacaaatt 6480 
tatacaaaat tttaagataa tgcactatca 6540 
tttccataac ttcttttacg tttccgccat 6600 
gcaagtcagc acgaacacga accgtcttat 6660 
gtgtatcatt tcgtttttct ttttgtgcgc 6720 
tatttcttct tattcttatt ttatcatcaa 6780 
gtgatttgtg ataagtgatt tgtgattaat 6840 
gtttattttg agaaagatat aaaatcaata 6900 
ttgtagggct ttttctcttt gtttttgatt 6960 
taattctttt acattcgttt tggttagttt 7020 
cttacattca ggataatttg aacaaccata 7080 
acacttagga cattttccta caatactttt 7140 
atcagaaaaa tttagttttt ctatatcgtt 7200 
ttttttgata ttcgtaataa agttctcttg 7260 
atacgtttcc catttagccg tcatttcagc 7320 
ctggcacaat aattttcctt tttcagttac 7380 
acagtcataa aaccactcct ttttaacaaa 74 40 
cgactataac ttaaatttat attttttact 7500 
ttgctcatat cgtttgccaa catctagtac 7560 
ttaaccgaac ttcccctgtc caaattagat 7620 
atagtaaatg aaaatgagaa gcgtgtatta 7680 
cgaacacaaa tatcgaagaa tttagaactt 7740 
aacttaaaac acaagagttt agttaatgaa 7800 
ggacgaaagc ctattttact cgaaattaac 7860 
ttaacatatg attccgttga attaatgtac 7920 
gattcctacg aattaaatga taaaaatgta 7980 
ataaacgtct cagaaaaata tgatacgtta 8040 
cacggtatcg ttgacgatga gcaaaacata 8100 
cgcacattta ccgatgaatt aaagtcattc 8160 
gcaaatttat cagcgctata tgaaaaaagt 8220 
attactttaa gtattcacaa gggtataggc 8280 
cgtggctcaa atggagaggc tggagagata 8340 
aacaatgaca acaaatatta taaaatcgaa 84 00 
aaaataaata ataggttggg cgtcacattg 84 60 
gaaggaaatt caattgttgc tcatgaaatt 8520 
attcataatt tgaatacaca atttaaccca 8580 
aatgaattac caaatatttt aaatgaaatt 8640 
agtccaattc aattacattt aactactaat 8700 
ttagcaataa tgcaaaaaac attaaatata 8760 
cagcagtctg agttataaaa tagatatctc 8820 
aatattttga tataattaaa tagttgtcgt 8880 
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ttgataacac agtttattct tgacaaaaaa 
tctcgagagt tagtttgttt attaaattaa 
acctgcagcc aagcttgggc ttttcagcct 
tctgataaaa cagaatttgc ctggcggcag 
gaactcagaa gtgaaacgcc gtagcgccga 
agggaactgc caggcatcaa ataaaacgaa 
ttatctgttg tttgtcggtg aacgctctcc 
tgaacgttgc gaagcaacgg cccggagggt 
ggcatcaaat taagcagaag gccatcctga 
tttttgttta tttttctaaa tacattcaaa 
cccc 



atataaaagt gtataataga aaagtactgt 8940 

ccaactggac gaattcccgg ggatccgtcg 9000 

gatacagatt aaatcagaac gcagaagcgg 9060 

tagcgcggtg gtcccacctg accccatgcc 9120 

tggtagtgtg gggtctcccc atgcgagagt 9180 

aggctcagtc gaaagactgg gcctttcgtt 9240 

tgagtaggac aaatccgccg ggagcggatt 9300 

ggcgggcagg acgcccgcca taaactgcca 9360 

cggatggcct ttttgcgttt ctacaaactc 9420 

tatgtatccg ctgatcccca tcctatcgat 9480 

9484 



<210> 50 
<211> 9566 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Vector pEPEF18 
<400> 50 

acccagtcag ctccttccgg tgggcgcggg 
tcttctttat catgcaactc gtaggacagg 
aggaccgctt tcgctggagc gcgacgatga 
tgcacgccct cgctcaagcc ttcgtcactg 
aggccattat cgccggcatg gcggccgacg 
cgcgaggctg gatggccttc cccattatga 
ccgcgttgca ggccatgctg tccaggcagg 
gatcgctcgc ggctcttacc agcctaactt 
tttatgccgc ctcggcgagc acatggaacg 
accttgtctg cctccccgcg ttgcgtcgcg 
tggaagccgg cggcacctcg ctaacggatt 
cttgcggaga actgtgaatg cgcaaaccaa 
catctccagc agccgcacgc ggcgcatctc 
catgatcgtg ctcctgtcgt tgaggacccg 
gcagaatgaa tcaccgatac gcgagcgaac 
gacctgagca acaacatgaa tggtcttcgg 
gaagtcccct acgtgctgct gaagttgccc 
cacgatacta tgactgagag tcaacgccat 
acagtccgca ccgctgtccg gtagctcctt 
cgcacttatg actgtcttct ttatcatgca 
cagtcccccg gccacggggc ctgccaccat 
tatgttccgg atctgcatcg caggatgctg 
attaacgaag cgctaaccgt ttttatcagg 
tcaattatta cctccacggg gagagcctga 
acggtcacac tgcttccggt agtcaataaa 
atttaacgac cctgccctga accgacgacc 
tcatccgctt attatcactt attcaggcgt 
gccttaaaaa aattacgccc cgccctgcca 
cattctgccg acatggaagc catcacagac 
cagcaccttg tcgccttgcg tataatattt 
gtccatattg gccacgttta aatcaaaact 
gaaaaacata ttctcaataa accctttagg 
cacatcttgc gaatatatgt gtagaaactg 
cgatgaaaac gtttcagttt gctcatggaa 
tatcaccagc tcaccgtctt tcattgccat 
tttaaatttt tgaaaaaaat aaaaaaaggc 
ttaatctttt gttcttttgc gtgaaaaaaa 
aacagtcaca aaaatcgatg tatacagtca 



gcatgactat cgtcgccgca cttatgactg 60 
tgccggcagc gctctgggtc attttcggcg 120 
tcggcctgtc gcttgcggta ttcggaatct 180 
gtcccgccac caaacgtttc ggcgagaagc 240 
cgctgggcta cgtcttgctg gcgttcgcga 300 
ttcttctcgc ttccggcggc atcgggatgc 360 
tagatgacga ccatcaggga cagcttcaag 420 
cgatcactgg accgctgatc gtcacggcga 480 
ggttggcatg gattgtaggc gccgccctat 540 
gtgcatggag ccgggccacc tcgacctgaa 600 
caccactcca agaattggag ccaatcaatt 660 
cccttggcag aacatatcca tcgcgtccgc 720 
gggcagcgtt gggtcctggc cacgggtgcg 780 
gctaggctgg cggggttgcc ttactggtta 840 
gtgaagcgac tgctgctgca aaacgtctgc 900 
tttccgtgtt tcgtaaagtc tggaaacgcg 960 
gcaacagaga gtggaaccaa ccggtgatac 1020 
gagcggcctc atttcttatt ctgagttaca 1080 
ccggtgggcg cggggcatga ctatcgtcgc 1140 
actcgtagga caggtgccgg cagcgcccaa 1200 
acccacgccg aaacaagcgc cctgcaccat 1260 
ctggctaccc tgtggaacac ctacatctgt 1320 
ctctgggagg cagaataaat gatcatatcg 1380 
gcaaactggc ctcaggcatt tgagaagcac 14 40 
ccggtaaacc agcaatagac ataagcggct 1500 
gggtcgaatt tgctttcgaa tttctgccat 1560 
agcaccaggc gtttaagggc accaataact 1620 
ctcatcgcag tactgttgta attcattaag 1680 
ggcatgatga acctgaatcg ccagcggcat 174 0 
gcccatggtg aaaacggggg cgaagaagtt 1800 
ggtgaaactc acccagggat tggctgagac 18 60 
gaaataggcc aggttttcac cgtaacacgc 1920 
ccggaaatcg tcgtggtatt cactccagag 1980 
aacggtgtaa caagggtgaa cactatccca 2040 
acgagaggtt tgatgacttt gacctttggt 2100 
gaagcctatt atatatttat catatatatt 2160 
aggcagtgtt ttcgctagtt atagaaatta 2220 
caaaaatcga tttttgtgac tctatgcatc 2280 
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gatgtatacg agtcacaaaa atcgatgtac acagtacgac ttttgtattt gtgtactgta 2340 
tatagtataa taaaagcata gagaaaactc actatgaaat gactttctct atgctactac 2400 
taaaacacgc aaaggagcgt atttatacta tgattatacc agaaaaacaa aacaaacaaa 2460 
agcaggtgct gaccttgaat gaattagaaa. aacgcaaagt agtggagcat aacgctttaa 2520 
ttcaaagtgt ggctaaaatg caaaaaacag ctttaaaaat gtttgagtta gccgtgtctt 2580 
gtattgatac cgaagaaccc ccaaaaaata atactgttta tcttttaaaa tctgaacttt 264 0 
ttaaattttt tgaagtatcg tctagcagta agcatagtca attcaaagaa gcagttaatt 2700 
atatgcaaaa gcaagccttt ttcaatatta aagctgataa gaaactaggt attgagtatg 2760 
aaagcattgt gcctattcct tatgtaaaat ggaacgatta taacgatgaa gtaacaatac 2820 
gttttgacca agccattatg ccctatctga ttgatttaaa ggcagaattt actcaatata 2880 
aaatttccga actacaaaaa ctcaatagca agtattcgat tattttgtat cgttggttat 2940 
ccatgaatta taaccaatac gaacattaca gcgtgaaagg gggacggaga gttgaacaag 3000 
tggaatccta ccgcaacccc tcaataaaag tgaaagaatt gcgagaaata actgatacaa 3060 
taaatgaaca tcaacatttt ccccattttg aaactagagt attaaaaaaa gcaattgaag 3120 
aaatcaacgc tcacacctct tttaatgtaa cctatgagaa aaagaaagca gggcggagca 3180 
ttgactctat tgtctttcat attgagaaga aacgcatggc agacgataac agttacaagt 3240 
tgggagataa agactatcaa gacgataaaa agcaaaaatc aagaaatgaa gctgacttac 3300 
taaaacaggc aatggaaagt aaatatacaa gacttttatc tgaaaatttc ttaattggca 3360 
tgaatgatat tatggataca gccacaatgg taggcttaca gaagaatgtt tatccgcttt 3420 
atgacgaact gaaagaatta agagggctaa atggtgtcaa agaccacttg tcttatgttg 3480 
ccagcaaaaa agaagcctat tctaaacgta atgtagcgaa atatcttaaa aaagccattg 3540 
aacaatatct gccaacggtt aaaaggcagg acttaaacca tgagtgaaga cttaaaaacg 3600 
ataaaagagt tggcggacga gctaggacta gctaaacaga ccgttcaata ccatttaaag 3660 
tttttgccga ccgaaaatcg acaaaaaaac aatagtggtt cgatcgtcct aaaccttgaa 3720 
gaacaaaagt ttatcaaaaa aagactaaca aaaataaaaa aagttcaacc gaccgaaaat 3780 
cgacaagaaa aaatgaagaa ttaagcgtta atagtctttt gttaggcgga tgagcattca 3840 
tcaggcgggc aagaatgtga ataaaggccg gataaaactt gtgcttattt ttctttacgg 3900 
tctttaaaaa ggccgtaata tccagctgaa cggtctggtt ataggtacat tgagcaactg 3960 
actgaaatgc ctcaaaatgt tctttacgat gccattggga tatatcaacg gtggtatatc 4020 
cagtgatttt tttctccatt ttagcttcct tagctcctga aaatctcgat aactcaaaaa 4080 
atacgcccgg tagtgatctt atttcattat ggtgaaagtt ggaacctctt acgtgccgat 4140 
caacgtctca ttttcgccaa aagttggccc agggcttccc ggtatcaaca gggacaccag 4200 
gatttattta ttctgcgaag tgatcttccg tcacaggtat ttattcggcg caaagtgcgt 4260 
cgggtgatgc tgccaactta ctgatttagt gtatgatggt gtttttgagg tgctccagtg 4320 
gcttctgttt ctatcagctg tccctcctgt tcagctactg acggggtggt gcgtaacggc 4380 
aaaagcaccg ccggacatca gcgctagcgg agtgtatact ggcttactat gttggcactg 4440 
atgagggtgt cagtgaagtg cttcatgtgg caggagaaaa aaggctgcac cggtgcgtca 4500 
gcagaatatg tgatacagga tatattccgc ttcctcgctc actgactcgc tacgctcggt 4560 
cgttcgactg cggcgagcgg aaatggctta cgaacggggc ggagatttcc tggaagatgc 4620 
caggaagata cttaacaggg aagtgagagg gccgcggcaa agccgttttt ccataggctc 4680 
cgcccccctg acaagcatca cgaaatctga cgctcaaatc agtggtggcg aaacccgaca 4740 
ggactataaa gataccaggc gtttccccct ggcggctccc tcgtgcgctc tcctgttcct 4800 
gcctttcggt ttaccggtgt cattccgctg ttatggccgc gtttgtctca ttccacgcct 4860 
gacactcagt tccgggtagg cagttcgctc caagctggac tgtatgcacg aaccccccgt 4920 
tcagtccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggaaagaca 4980 
tgcaaaagca ccactggcag cagccactgg taattgattt agaggagtta gtcttgaagt 504 0 
catgcgccgg ttaaggctaa actgaaagga caagttttgg tgactgcgct cctccaagcc 5100 
agttacctcg gttcaaagag ttggtagctc agagaacctt cgaaaaaccg ccctgcaagg 5160 
cggttttttc gttttcagag caagagatta cgcgcagacc aaaacgatct caagaagatc 5220 
atcttattaa tcagataaaa tatttctaga tttcagtgca atttatctct tcaaatgtag 5280 
cacctgaagt cagccccata cgatataagt tgtaattctc atgtttgaca gcttatcatc 5340 
gatacaaatt ccccgtaggc gctagggacc tctttagctt cttggaagct gtcagtagta 5400 
tatctaataa tttatctcca ttccctttag taacgtgtaa ctttccaaat ttaaaaaagc 54 60 
gactcataga attatttcct cccgttaaat aatagataac tattaaaaat agacaatact 5520 
tgctcataag taatggtact taaattgttt actttggcgt gtttcattgc ttgatgaaac 5580 
tgatttttag taaacagttg acgatattct cgattgaccc attttgaaac aaagtacgta 5640 
tatagcttcc aatatttatc tggaacatct gtggtatggc gggtaagttt tattaagaca 5700 
ctgtttactt ttggtttagg atgaaagcat tccgctggca gcttaagcaa ttgctgaatc 5760 
gagacttgag tgtgcaagag caaccctagt gttcggtgaa tatccaaggt acgcttgtag 5820 
aatccttctt caacaatcag atagatgtca gacgcacggc tttcaaaaac cactttttta 5880 
ataatttgtg tgcttaaatg gtaaggaata ttcccaacaa ttttatacct ctgtttgtta 5940 
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gggaattgaa actgtagaat atcttggtga 
ttttctgacg ataagttgaa tagatgactg 
attttagcca gtttcgtcgt taaatgccct 
gtttctttta aattcaattg ttttattatt 
ttttgagaat attttatatt tttgttcatg 
acaaatacag aagttaaacg atttgtttgt 
gattagtcac tggtaggaat taatctaacg 
agtctgtttc aaaacagtag atgttttatc 
gaatccctcc ttcttaatta caaattttta 
caaaatttta agataatgca ctatcaacac 
cataacttct tttacgtttc cgccattctt 
gtcagcacga acacgaaccg tcttatctcc 
atcatttcgt ttttcttttt gtgcgcctaa 
tcttcttatt cttattttat catcaacaat 
tttgtgataa gtgatttgtg attaatatat 
attttgagaa agatataaaa tcaatatatc 
agggcttttt ctctttgttt ttgatttcct 
tcttttacat tcgttttggt tagttttttc 
cattcaggat aatttgaaca accataaaac 
ttaggacatt ttcctacaat acttttttct 
gaaaaattta gtttttctat atcgttaggt 
ttgatattcg taataaagtt ctzcttgattg 
gtttcccatt tagccgtcat ttcagcactc 
cacaataatt ttcctttttc agttacaaca 
tcataaaacc actccttttt aacaaacttt 
tataacttaa atttatattt tttactttat 
tcatatcgtt tgccaacatc tagtactcaa 
ccgaacttcc cctgtccaaa ttagataaga 
taaatgaaaa tgagaagcgt gtattaaaac 
cacaaatatc gaagaattta gaacttaata 
taaaacacaa gagtttagtt aatgaagtag 
gaaagcctat tttactcgaa attaaccaaa 
catatgattc cgttgaatta atgtacaact 
cctacgaatt aaatgataaa aatgtaagca 
acgtctcaga aaaatatgat acgttatatg 
gtatcgttga cgatgagcaa aacataatca 
catttaccga tgaattaaag tcattcacaa 
atttatcagc gctatatgaa aaaagtttat 
ctttaagtat tcacaagggt ataggcgctg 
gctcaaatgg agaggctgga gagataggta 
atgacaacaa atattataaa atcgaagata 
taaataatag gttgggcgtc acattgacgt 
gaaattcaat tgttgctcat gaaattaaac 
ataatttgaa tacacaattt aacccagacg 
aattaccaaa tattttaaat gaaattaaag 
caattcaatt acatttaact actaatgtaa 
caataatgca aaaaacatta aatataaata 
agtctgagtt ataaaataga tatctcggac 
tttttatgtt ttagaataag tggtctagtt 
aattaaatag ttgtcgtttg agacgactga 
tgaaaagata acacagttat tcttgacaaa 
tcgagagtta gtttgtttat taaattaacc 
ctgcagccaa gcttgggctt ttcagcctga 
tgataaaaca gaatttgcct ggcggcagta 
actcagaagt gaaacgccgt agcgccgatg 
ggaactgcca ggcatcaaat aaaacgaaag 
atctgttgtt tgtcggtgaa cgctctcctg 
aacgttgcga agcaacggcc cggagggtgg 
catcaaatta agcagaaggc catcctgacg 
tttgtttatt tttctaaata cattcaaata 



attaaagtga cacgagtatt cagttttaat 6000 
tctaattcaa tagacgttac ctgtttactt 6060 
ttacctgttc caatttcgta aacggtatcg 6120 
tggttgagta ctttttcact cgttaaaaag 6180 
taatctctcc tgaagtgatt acatctataa 6240 
aattttagtt atctgtttaa aaagtcataa 6300 
tatttattta tctgcgtaat cactgttttt 6360 
tacattacgc atttggaata ccaacatgac 6420 
gcatctaatt taacttcaat tcctattata 6480 
actcttaagt ttgcttctaa gtcttatttc 6540 
tgctgtttcg atttttatga tatggtgcaa 6600 
cattatatct ttttttgcac tgattggtgt 6660 
atttcccaca atcactcact tctttctatt 6720 
cacaaatcac ttgtgatttg tgataagtga 6780 
aaaagccctc tttaaagggc ttttatgttt 6840 
ccttttcccc aatttttaca acggcattgt 6900 
tttaccaggg tttcttttcc ctccagtaat 6960 
tttctaaaat gttcagctaa agtaaactta 7020 
gattttttta atacaatatt gttgccacac 7080 
gcttcttttt ctttctgttc ctggtaatca 7140 
acagcttcca gtaaatgaac aatgaatttt 7200 
ccttctcttt taccgatttt ttttaaatac 7260 
gttaaaaggt gctgactttc aactgcctgg 7320 
gcctgcaggt cgacgataac cacataacag 7380 
atcacaagaa atattttggc attctacgac 7440 
aatatataat tgattataga ataatgttgc 7500 
attacactat gttacacttg gtaatattaa 7560 
ggtaataata aatggaaaat aattttatag 7620 
aaattttcaa taacagcaat atttcacgaa 7680 
aagctactat ttctaacatt ctgaacaact 7740 
gagaaggtaa tagtactaaa agtggtggac 7800 
aatatggcta ctatatttct atggatttaa 7860 
actttgatgc tactatatta aagcaagatt 7920 
gtatattaca aattttaaaa tctaatataa 7980 
ggttacttgg tatatctata tccatacacg 8040 
atcttccttt tcataaaaat gagaaacgca 8100 
atgttcctgt cgttatagaa aatgaaqcaa 8160 
atattaattc aaacataaat aatttgatta 8220 
gcatcataat aaataaaaaa ctttatcgtg 8280 
agacattggt tttggaatct ataaataaca 8340 
tatgctccca agacgcttta atacagaaaa 8400 
ttacagaact aatccaatat tacaacgaag 8460 
aatttattaa taaaatgaca gttctgattc 8520 
ctatttatat taactgtcct ttaattaatg 8580 
agcaattctc ctgtttttct caaggcagtc 8640 
aacaagctac tttattgggt ggcactttag 8700 
acattcaaat gaatattaaa taattacagc 8760 
cgaaacagcc gaataaggct gtttttcttt 8820 
tattcttgac aaaaaataat attttgatat 8880 
ctttcttatt attcatctaa aatattattt 8940 
aaaatataaa agtgtataat agaaaagtac 9000 
aactggacga attcccgggg atccgtcgac 9060 
tacagattaa atcagaacgc agaagcggtc 9120 
gcgcggtggt cccacctgac cccatgccga 9180 
gtagtgtggg gtctccccat gcgagagtag 9240 
gctcagtcga aagactgggc ctttcgtttt 9300 
agtaggacaa atccgccggg agcggatttg 9360 
cgggcaggac gcccgccata aactgccagg 9420 
gatggccttt ttgcgtttct acaaactctt 9480 
tgtatccgct gatccccatc ctatcgatcc 9540 
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ccacccgatg cccttgagag ccttca 9566 



<210> 51 
<211> 9412 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pEPEF20 
<400> 51 

acccgatgcc cttgagagcc ttcaacccag tcagctcctt ccggtgggcg cggggcatga 60 
ctatcgtcgc cgcacttatg actgtcttct ttatcatgca actcgtagga caggtgccgg 120 
cagcgctctg ggtcattttc ggcgaggacc gctttcgctg gagcgcgacg atgatcggcc 180 
tgtcgcttgc ggtattcgga atcttgcacg ccctcgctca agccttcgtc actggtcccg 240 
ccaccaaacg tttcggcgag aagcaggcca ttatcgccgg catggcggcc gacgcgctgg 300 
gctacgtctt gctggcgttc gcgacgcgag gctggatggc cttccccatt atgattcttc 360 
tcgcttccgg cggcatcggg atgcccgcgt tgcaggccat gctgtccagg caggtagatg 420 
acgaccatca gggacagctt caaggatcgc tcgcggctct taccagccta acttcgatca 480 
ctggaccgct gatcgtcacg gcgatttatg ccgcctcggc gagcacatgg aacgggttgg 540 
catggattgt aggcgccgcc ctataccttg tctgcctccc cgcgttgcgt cgcggtgcat 600 
ggagccgggc cacctcgacc tgaatggaag ccggcggcac ctcgctaacg gattcaccac 660 
tccaagaatt ggagccaatc aattcttgcg gagaactgtg aatgcgcaaa ccaacccttg 720 
gcagaacata tccatcgcgt ccgccatctc cagcagccgc acgcggcgca tctcgggcag 780 
cgttgggtcc tggccacggg tgcgcatgat cgtgctcctg tcgttgagga cccggctagg 840 
ctggcggggt tgccttactg gttagcagaa tgaatcaccg atacgcgagc gaacgtgaag 900 
cgactgctgc tgcaaaacgt ctgcgacctg agcaacaaca tgaatggtct tcggtttccg 960 
tgtttcgtaa agtctggaaa cgcggaagtc ccctacgtgc tgctgaagtt gcccgcaaca 1020 
gagagtggaa ccaaccggtg ataccacgat actatgactg agagtcaacg ccatgagcgg 1080 
cctcatttct tattctgagt tacaacagtc cgcaccgctg tccggtagct ccttccggtg 1140 
ggcgcggggc atgactatcg tcgccgcact tatgactgtc ttctttatca tgcaactcgt 1200 
aggacaggtg ccggcagcgc ccaacagtcc cccggccacg gggcctgcca ccatacccac 1260 
gccgaaacaa gcgccctgca ccattatgtt ccggatctgc atcgcaggat gctgctggct 1320 
accctgtgga acacctacat ctgtattaac gaagcgctaa ccgtttttat caggctctgg 1380 
gaggcagaat aaatgatcat atcgtcaatt attacctcca cggggagagc ctgagcaaac 1440 
tggcctcagg catttgagaa gcacacggtc acactgcttc cggtagtcaa taaaccggta 1500 
aaccagcaat agacataagc ggctatttaa cgaccctgcc ctgaaccgac gaccgggtcg 1560 
aatttgcttt cgaatttctg ccattcatcc gcttattatc acttattcag gcgtagcacc 1620 
aggcgtttaa gggcaccaat aactgcctta aaaaaattac gccccgccct gccactcatc 1680 
gcagtactgt tgtaattcat taagcattct gccgacatgg aagccatcac agacggcatg 174 0 
atgaacctga atcgccagcg gcatcagcac cttgtcgcct tgcgtataat atttgcccat 1800 
ggtgaaaacg ggggcgaaga agttgtccat attggccacg tttaaatcaa aactggtgaa 1860 
actcacccag ggattggctg agacgaaaaa catattctca ataaaccctt tagggaaata 1920 
ggccaggttt tcaccgtaac acgccacatc ttgcgaatat atgtgtagaa actgccggaa 1980 
atcgtcgtgg tattcactcc agagcgatga aaacgtttca gtttgctcat ggaaaacggt 2040 
gtaacaaggg tgaacactat cccatatcac cagctcaccg tctttcattg ccatacgaga 2100 
ggtttgatga ctttgacctt tggttttaaa tttttgaaaa aaataaaaaa aggcgaagcc 2160 
tattatatat ttatcatata tattttaatc ttttgttctt ttgcgtgaaa aaaaaggcag 2220 
tgttttcgct agttatagaa attaaacagt cacaaaaatc gatgtataca gtcacaaaaa 2280 
tcgatttttg tgactctatg catcgatgta tacgagtcac aaaaatcgat gtacacagta 2340 
cgacttttgt atttgtgtac tgtatatagt ataataaaag catagagaaa actcactatg 2400 
aaatgacttt ctctatgcta ctactaaaac acgcaaagga gcgtatttat actatgatta 24 60 
taccagaaaa acaaaacaaa caaaagcagg tgctgacctt gaatgaatta gaaaaacgca 2520 
aagtagtgga gcataacgct ttaattcaaa gtgtggctaa aatgcaaaaa acagctttaa 2580 
aaatgtttga gttagccgtg tcttgtattg ataccgaaga acccccaaaa aataatactg 2640 
tttatctttt aaaatctgaa ctttttaaat tttttgaagt atcgtctagc agtaagcata 2700 
gtcaattcaa agaagcagtt aattatatgc aaaagcaagc ctttttcaat attaaagctg 2760 
ataagaaact aggtattgag tatgaaagca ttgtgcctat tccttatgta aaatggaacg 2820 
attataacga tgaagtaaca atacgttttg accaagccat tatgccctat ctgattgatt 2880 
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taaaggcaga atttactcaa tataaaattt ccgaactaca aaaactcaat agcaagtatt 2940 
cgattatttt gtatcgttgg ttatccatga attataacca atacgaacat tacagcgtga 3000. 
aagggggacg gagagttgaa caagtggaat cctaccgcaa cccctcaata aaagtgaaag 3060 
aattgcgaga aataactgat acaataaatg aacatcaaca ttttccccat tttgaaacta 3120 
gagtattaaa aaaagcaatt gaagaaatca acgctcacac ctcttttaat gtaacctatg 3180 
agaaaaagaa agcagggcgg agcattgact ctattgtctt tcatattgag aagaaacgca 3240 
tggcagacga taacagttac aagttgggag ataaagacta tcaagacgat aaaaagcaaa 3300 
aatcaagaaa tgaagctgac ttactaaaac aggcaatgga aagtaaatat acaagacttt 3360 
tatctgaaaa tttcttaatt ggcatgaatg atattatgga tacagccaca atggtaggct 3420 
tacagaagaa tgtttatccg ctttatgacg aactgaaaga attaagaggg ctaaatggtg 3480 
tcaaagacca cttgtcttat gttgccagca aaaaagaagc ctattctaaa cgtaatgtag 3540 
cgaaatatct taaaaaagcc attgaacaat atctgccaac ggttaaaagg caggacttaa 3600 
accatgagtg aagacttaaa aacgataaaa gagttggcgg acgagctagg actagctaaa 3660 
cagaccgttc aataccattt aaagtttttg ccgaccgaaa atcgacaaaa aaacaatagt 3720 
ggttcgatcg tcctaaacct tgaagaacaa aagtttatca aaaaaagact aacaaaaata 3780 
aaaaaagttc aaccgaccga aaatcgacaa gaaaaaatga agaattaagc gttaatagtc 3840 
ttttgttagg cggatgagca ttcatcaggc gggcaagaat gtgaataaag gccggataaa 3900 
acttgtgctt atttttcttt acggtcttta aaaaggccgt aatatccagc tgaacggtct 3960 
ggttataggt acattgagca actgactgaa atgcctcaaa atgttcttta cgatgccatt 4020 
gggatatatc aacggtggta tatccagtga tttttttctc cattttagct tccttagctc 4080 
ctgaaaatct cgataactca aaaaatacgc ccggtagtga tcttatttca ttatggtgaa 4140 
agttggaacc tcttacgtgc cgatcaacgt ctcattttcg ccaaaagttg gcccagggct 4200 
tcccggtatc aacagggaca ccaggattta tttattctgc gaagtgatct tccgtcacag 4260 
gtatttattc ggcgcaaagt gcgtcgggtg atgctgccaa cttactgatt tagtgtatga 4320 
tggtgttttt gaggtgctcc agtggcttct gtttctatca gctgtccctc ctgttcagct 4380 
actgacgggg tggtgcgtaa cggcaaaagc accgccggac atcagcgcta gcggagtgta 4440 
tactggctta ctatgttggc actgatgagg gtgtcagtga agtgcttcat gtggcaggag 4500 
aaaaaaggct gcaccggtgc gtcagcagaa tatgtgatac aggatatatt ccgcttcctc 4560 
gctcactgac tcgctacgct cggtcgttcg actgcggcga gcggaaatgg cttacgaacg 4620 
gggcggagat ttcctggaag atgccaggaa gatacttaac agggaagtga gagggccgcg 4 680 
gcaaagccgt ttttccatag gctccgcccc cctgacaagc atcacgaaat ctgacgctca 4740 
aatcagtggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc ccctggcggc 4800 
tccctcgtgc gctctcctgt tcctgccttt cggtttaccg gtgtcattcc gctgttatgg 4860 
ccgcgtttgt ctcattccac gcctgacact cagttccggg taggcagttc gctccaagct 4920 
ggactgtatg cacgaacccc ccgttcagtc cgaccgctgc gccttatccg gtaactatcg 4980 
tcttgagtcc aacccggaaa gacatgcaaa agcaccactg gcagcagcca ctggtaattg 5040 
atttagagga gttagtcttg aagtcatgcg ccggttaagg ctaaactgaa aggacaagtt 5100 
ttggtgactg cgctcctcca agccagttac ctcggttcaa agagttggta gctcagagaa 5160 
ccttcgaaaa accgccctgc aaggcggttt tttcgttttc agagcaagag attacgcgca 5220 
gaccaaaacg atctcaagaa gatcatctta ttaatcagat aaaatatttc tagatttcag 5280 
tgcaatttat ctcttcaaat gtagcacctg aagtcagccc catacgatat aagttgtaat 5340 
tctcatgttt gacagcttat catcgataca aattccccgt aggcgctagg gacctcttta 5400 
gcttcttgga agctgtcagt agtatatcta ataatttatc tccattccct ttagtaacgt 5460 
gtaactttcc aaatttaaaa aagcgactca tagaattatt tcctcccgtt aaataataga 5520 
taactattaa aaatagacaa tacttgctca taagtaatgg tacttaaatt gtttactttg 5580 
gcgtgtttca ttgcttgatg aaactgattt ttagtaaaca gttgacgata ttctcgattg 5640 
acccattttg aaacaaagta cgtatatagc ttccaatatt tatctggaac atctgtggta 5700 
tggcgggtaa gttttattaa gacactgttt acttttggtt taggatgaaa gcattccgct 5760 
ggcagcttaa gcaattgctg aatcgagact tgagtgtgca agagcaaccc tagtgttcgg 5820 
tgaatatcca aggtacgctt gtagaatcct tcttcaacaa tcagatagat gtcagacgca 5880 
cggctttcaa aaaccacttt tttaataatt tgtgtgctta aatggtaagg aatattccca 5940 
acaattttat acctctgttt gttagggaat tgaaactgta gaatatcttg gtgaattaaa 6000 
gtgacacgag tattcagttt taatttttct gacgataagt tgaatagatg actgtctaat 6060 
tcaatagacg ttacctgttt acttatttta gccagtttcg tcgttaaatg ccctttacct 6120 
gttccaattt cgtaaacggt atcggtttct tttaaattca attgttttat tatttggttg 6180 
agtacttttt cactcgttaa aaagttttga gaatatttta tatttttgtt catgtaatct 6240 
ctcctgaagt gattacatct ataaacaaat acagaagtta aacgatttgt ttgtaatttt 6300 
agttatctgt ttaaaaagtc ataagattag tcactggtag gaattaatct aacgtattta 6360 
tttatctgcg taatcactgt ttttagtctg tttcaaaaca gtagatgttt tatctacatt 6420 
acgcatttgg aataccaaca tgacgaatcc ctccttctta attacaaatt tttagcatct 6480 
aatttaactt caattcctat tatacaaaat tttaagataa tgcactatca acacactctt 6540 
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aagtttgctt ctaagtctta 
ttcgattttt atgatatggt 
atcttttttt gcactgattg 
cacaatcact cacttctttc 
tcacttgtga tttgtgataa 
cctctttaaa gggcttttat 
ccccaatttt tacaacggca 
agggtttctt ttccctccag 
aaatgttcag ctaaagtaaa 
tttaatacaa tattgttgcc 
ttttctttct gttcctggta 
tccagtaaat gaacaatgaa 
cttttaccga ttttttttaa 
aggtgctgac tttcaactgc 
aggtcgacga taaccacata 
agaaatattt tggcattcta 
taattgatta tagaataatg 
ctatgttaca cttggtaata 
aataaatgga aaataatttt 
tcaataacag caatatttca 
ctatttctaa cattctgaac 
gtaatagtac taaaagtggt 
gctactatat ttctatggat 
atgctactat attaaagcaa 
tacaaatttt aaaatctaat 
ttggtatatc tatatccata 
cttttcataa aaatgagaaa 
ctgtcgttat agaaaatgaa 
attcaaacat aaataatttg 
taataaataa aaaactttat 
tggttttgga atctataaat 
cccaagacgc tttaatacag 
aactaatcca atattacaac 
ttaataaaat gacagttctg 
atattaactg tcctttaatt 
tctcctgttt ttctcaaggc 
ctactttatt gggtggcact 
aaatgaatat taaataatta 
gtgaactttt ttgttgacaa 
gtttgtttat taaattaacc 
gcttgggctt ttcagcctga 
gaatttgcct ggcggcagta 
gaaacgccgt agcgccgatg 
ggcatcaaat aaaacgaaag 
tgtcggtgaa cgctctcctg 
agcaacggcc cggagggtgg 
agcagaaggc catcctgacg 
tttctaaata cattcaaata 



tttccataac ttcttttacg 
gcaagtcagc acgaacacga 
gtgtatcatt tcgtttttct 
tatttcttct tattcttatt 
gtgatttgtg ataagtgatt 
gtttattttg agaaagatat 
ttgtagggct ttttctcttt 
taattctttt acattcgttt 
cttacattca ggataatttg 
acacttagga cattttccta 
atcagaaaaa tttagttttt 
ttttttgata ttcgtaataa 
atacgtttcc catttagccg 
ctggcacaat aattttcctt 
acagtcataa aaccactcct 
cgactataac ttaaatttat 
ttgctcatat cgtttgccaa 
ttaaccgaac ttcccctgtc 
atagtaaatg aaaatgagaa 
cgaacacaaa tatcgaagaa 
aacttaaaac acaagagttt 
ggacgaaagc ctattttact 
ttaacatatg attccgttga 
gattcctacg aattaaatga 
ataaacgtct cagaaaaata 
cacggtatcg ttgacgatga 
cgcacattta ccgatgaatt 
gcaaatttat cagcgctata 
attactttaa gtattcacaa 
cgtggctcaa atggagaggc 
aacaatgaca acaaatatta 
aaaataaata ataggttggg 
gaaggaaatt caattgttgc 
attcataatt tgaatacaca 
aatgaattac caaatatttt 
agtccaattc aattacattt 
ttagcaataa tgcaaaaaac 
cagcagtctg agttataaaa 
agataaaaac acatgatata 
aactggacga attcccgggg 
tacagattaa atcagaacgc 
gcgcggtggt cccacctgac 
gtagtgtggg gtctccccat 
gctcagtcga aagactgggc 
agtaggacaa atccgccggg 
cgggcaggac gcccgccata 
gatggccttt ttgcgtttct 
tgtatccgct gatccccatc 



tttccgccat tctttgctgt 6600 
accgtcttat ctcccattat 6660 
ttttgtgcgc ctaaatttcc 6720 
ttatcatcaa caatcacaaa 6780 
tgtgattaat atataaaagc 6840 
aaaatcaata tatccctttt 6900 
gtttttgatt tccttttacc 6960 
tggttagttt tttctttcta 7020 
aacaaccata aaacgatttt 7080 
caatactttt ttctgcttct 7140 
ctatatcgtt aggtacagct 7200 
agttctcttg attgccttct 7260 
tcatttcagc actcgttaaa 7320 
tttcagttac aacagcctgc 7380 
ttttaacaaa ctttatcaca 7440 
attttttact ttataatata 7500 
catctagtac tcaaattaca 7560 
caaattagat aagaggtaat 7620 
gcgtgtatta aaacaaattt 7680 
tttagaactt aataaagcta 7740 
agttaatgaa gtaggagaag 7800 
cgaaattaac caaaaatatg 7860 
attaatgtac aactactttg 7920 
taaaaatgta agcagtatat 7980 
tgatacgtta tatgggttac 8040 
gcaaaacata atcaatcttc 8100 
aaagtcattc acaaatgttc 8160 
tgaaaaaagt ttatatatta 8220 
gggtataggc gctggcatca 8280 
tggagagata ggtaagacat 8340 
taaaatcgaa gatatatgct 8400 
cgtcacattg acgtttacag 8460 
tcatgaaatt aaacaattta 8520 
atttaaccca gacgctattt 8580 
aaatgaaatt aaagagcaat 8640 
aactactaat gtaaaacaag 8700 
attaaatata aataacattc 8760 
tagatatctc ggaccgtttc 8820 
cttaaatcac tcgagagtta 8880 
atccgtcgac ctgcagccaa 8940 
agaagcggtc tgataaaaca 9000 
cccatgccga actcagaagt 9060 
gcgagagtag ggaactgcca 9120 
ctttcgtttt atctgttgtt 9180 
agcggatttg aacgttgcga 9240 
aactgccagg catcaaatta 9300 
acaaactctt tttgtttatt 9360 
ctatcgatcc cc 9412 



<210> 52 
<211> 9412 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pEPEF22 
<4O0> 52 

acccgatgcc cttgagagcc ttcaacccag tcagctcctt ccggtgggcg cggggcatga 60 
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ctatcgtcgc cgcacttatg actgtcttct 
cagcgctctg ggtcattttc ggcgaggacc 
tgtcgcttgc ggtattcgga atcttgcacg 
ccaccaaacg tttcggcgag aagcaggcca 
gctacgtctt gctggcgttc gcgacgcgag 
tcgcttccgg cggcatcggg atgcccgcgt 
acgaccatca gggacagctt caaggatcgc 
ctggaccgct gatcgtcacg gcgatttatg 
catggattgt aggcgccgcc ctataccttg 
ggagccgggc cacctcgacc tgaatggaag 
tccaagaatt ggagccaatc aattcttgcg 
gcagaacata tccatcgcgt ccgccatctc 
cgttgggtcc tggccacggg tgcgcatgat 
ctggcggggt tgccttactg gttagcagaa 
cgactgctgc tgcaaaacgt ctgcgacctg 
tgtttcgtaa agtctggaaa cgcggaagtc 
gagagtggaa ccaaccggtg ataccacgat 
cctcatttct tattctgagt tacaacagtc 
ggcgcggggc atgactatcg tcgccgcact 
aggacaggtg ccggcagcgc ccaacagtcc 
gccgaaacaa gcgccctgca ccattatgtt 
accctgtgga acacctacat ctgtattaac 
gaggcagaat aaatgatcat atcgtcaatt 
tggcctcagg catttgagaa gcacacggtc 
aaccagcaat agacataagc ggctatttaa 
aatttgcttt cgaatttctg ccattcatcc 
aggcgtttaa gggcaccaat aactgcctta 
gcagtactgt tgtaattcat taagcattct 
atgaacctga atcgccagcg gcatcagcac 
ggtgaaaacg ggggcgaaga agttgtccat 
actcacccag ggattggctg agacgaaaaa 
ggccaggttt tcaccgtaac acgccacatc 
atcgtcgtgg tattcactcc agagcgatga 
gtaacaaggg tgaacactat cccatatcac 
ggtttgatga ctttgacctt tggttttaaa 
tattatatat ttatcatata tattttaatc 
tgttttcgct agttatagaa attaaacagt 
tcgatttttg tgactctatg catcgatgta 
cgacttttgt atttgtgtac tgtatatagt 
aaatgacttt ctctatgcta ctactaaaac 
taccagaaaa acaaaacaaa caaaagcagg 
aagtagtgga gcataacgct ttaattcaaa 
aaatgtttga gttagccgtg tcttgtattg 
tttatctttt aaaatctgaa ctttttaaat 
gtcaattcaa agaagcagtt aattatatgc 
ataagaaact aggtattgag tatgaaagca 
attataacga tgaagtaaca atacgttttg 
taaaggcaga atttactcaa tataaaattt 
cgattatttt gtatcgttgg ttatccatga 
aagggggacg gagagttgaa caagtggaat 
aattgcgaga aataactgat acaataaatg 
gagtattaaa aaaagcaatt gaagaaatca 
agaaaaagaa agcagggcgg agcattgact 
tggcagacga taacagttac aagttgggag 
aatcaagaaa tgaagctgac ttactaaaac 
tatctgaaaa tttcttaatt ggcatgaatg 
tacagaagaa tgtttatccg ctttatgacg 
tcaaagacca cttgtcttat gttgccagca 
cgaaatatct taaaaaagcc attgaacaat 
accatgagtg aagacttaaa aacgataaaa 
cagaccgttc aataccattt aaagtttttg 



ttatcatgca actcgtagga caggtgccgg 120 
gctttcgctg gagcgcgacg atgatcggcc 180 
ccctcgctca agccttcgtc actggtcccg 240 
ttatcgccgg catggcggcc gacgcgctgg 300 
gctggatggc cttccccatt atgattcttc 360 
tgcaggccat gctgtccagg caggtagatg 420 
tcgcggctct taccagccta acttcgatca 480 
ccgcctcggc gagcacatgg aacgggttgg 540 
tctgcctccc cgcgttgcgt cgcggtgcat 600 
ccggcggcac ctcgctaacg gattcaccac 660 
gagaactgtg aatgcgcaaa ccaacccttg 720 
cagcagccgc acgcggcgca tctcgggcag 780 
cgtgctcctg tcgttgagga cccggctagg 840 
tgaatcaccg atacgcgagc gaacgtgaag 900 
agcaacaaca tgaatggtct tcggtttccg 960 
ccctacgtgc tgctgaagtt gcccgcaaca 1020 
actatgactg agagtcaacg ccatgagcgg 1080 
cgcaccgctg tccggtagct ccttccggtg 1140 
tatgactgtc ttctttatca tgcaactcgt 1200 
cccggccacg gggcctgcca ccatacccac 1260 
ccggatctgc atcgcaggat gctgctggct 1320 
gaagcgctaa ccgtttttat caggctctgg 1380 
attacctcca cggggagagc ctgagcaaac 1440 
acactgcttc cggtagtcaa taaaccggta 1500 
cgaccctgcc ctgaaccgac gaccgggtcg 1560 
gcttattatc acttattcag gcgtagcacc 1620 
aaaaaattac gccccgccct gccactcatc 1680 
gccgacatgg aagccatcac agacggcatg 1740 
cttgtcgcct tgcgtataat atttgcccat 1800 
attggccacg tttaaatcaa aactggtgaa 1860 
catattctca ataaaccctt tagggaaata 1920 
ttgcgaatat atgtgtagaa actgccggaa 1980 
aaacgtttca gtttgctcat ggaaaacggt 2040 
cagctcaccg tctttcattg ccatacgaga 2100 
tttttgaaaa aaataaaaaa aggcgaagcc 2160 
ttttgttctt ttgcgtgaaa aaaaaggcag 2220 
cacaaaaatc gatgtataca gtcacaaaaa 2280 
tacgagtcac aaaaatcgat gtacacagta 2340 
ataataaaag catagagaaa actcactatg 2400 
acgcaaagga gcgtatttat actatgatta 2460 
tgctgacctt gaatgaatta gaaaaacgca 2520 
gtgtggctaa aatgcaaaaa acagctttaa 2580 
ataccgaaga acccccaaaa aataatactg 2640 
tttttgaagt atcgtctagc agtaagcata 2700 
aaaagcaagc ctttttcaat attaaagctg 2760 
ttgtgcctat tccttatgta aaatggaacg 2820 
accaagccat tatgccctat ctgattgatt 2880 
ccgaactaca aaaactcaat agcaagtatt 2940 
attataacca atacgaacat tacagcgtga 3000 
cctaccgcaa cccctcaata aaagtgaaag 3060 
aacatcaaca ttttccccat tttgaaacta 3120 
acgctcacac ctcttttaat gtaacctatg 3180 
ctattgtctt tcatattgag aagaaacgca 3240 
ataaagacta tcaagacgat aaaaagcaaa 3300 
aggcaatgga aagtaaatat acaagacttt 3360 
atattatgga tacagccaca atggtaggct 3420 
aactgaaaga attaagaggg ctaaatggtg 3480 
aaaaagaagc ctattctaaa cgtaatgtag 3540 
atctgccaac ggttaaaagg caggacttaa 3600 
gagttggcgg acgagctagg actagctaaa 3660 
ccgaccgaaa atcgacaaaa aaacaatagt 3720 
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ggttcgatcg tcctaaacct tgaagaacaa 
aaaaaagttc aaccgaccga aaatcgacaa 
ttttgttagg cggatgagca ttcatcaggc 
acttgtgctt atttttcttt acggtcttta 
ggttataggt acattgagca actgactgaa 
gggatatatc aacggtggta tatccagtga 
ctgaaaatct cgataactca aaaaatacgc 
agttggaacc tcttacgtgc cgatcaacgt 
tcccggtatc aacagggaca ccaggattta 
gtatttattc ggcgcaaagt gcgtcgggtg 
tggtgttttt gaggtgctcc agtggcttct 
actgacgggg tggtgcgtaa cggcaaaagc 
tactggctta ctatgttggc actgatgagg 
aaaaaaggct gcaccggtgc gtcagcagaa 
gctcactgac tcgctacgct cggtcgttcg 
gggcggagat ttcctggaag atgccaggaa 
gcaaagccgt ttttccatag gctccgcccc 
aatcagtggt ggcgaaaccc gacaggacta 
tccctcgtgc gctctcctgt tcctgccttt 
ccgcgtttgt ctcattccac gcctgacact 
ggactgtatg cacgaacccc ccgttcagtc 
tcttgagtcc aacccggaaa gacatgcaaa 
atttagagga gttagtcttg aagtcatgcg 
ttggtgactg cgctcctcca agccagttac 
ccttcgaaaa accgccctgc aaggcggttt 
gaccaaaacg atctcaagaa gatcatctta 
tgcaatttat ctcttcaaat gtagcacctg 
tctcatgttt gacagcttat catcgataca 
gcttcttgga agctgtcagt agtatatcta 
gtaactttcc aaatttaaaa aagcgactca 
taactattaa aaatagacaa tacttgctca 
gcgtgtttca ttgcttgatg aaactgattt 
acccattttg aaacaaagta cgtatatagc 
tggcgggtaa gttttattaa gacactgttt 
ggcagcttaa gcaattgctg aatcgagact 
tgaatatcca aggtacgctt gtagaatcct 
cggctttcaa aaaccacttt tttaataatt 
acaattttat acctctgttt gttagggaat 
gtgacacgag tattcagttt taatttttct 
tcaatagacg ttacctgttt acttatttta 
gttccaattt cgtaaacggt atcggtttct 
agtacttttt cactcgttaa aaagttttga 
ctcctgaagt gattacatct ataaacaaat 
agttatctgt ttaaaaagtc ataagattag 
tttatctgcg taatcactgt ttttagtctg 
acgcatttgg aataccaaca tgacgaatcc 
aatttaactt caattcctat tatacaaaat 
aagtttgctt ctaagtctta tttccataac 
ttcgattttt atgatatggt gcaagtcagc 
atcttttttt gcactgattg gtgtatcatt 
cacaatcact cacttctttc tatttcttct 
tcacttgtga tttgtgataa gtgatttgtg 
cctctttaaa gggcttttat gtttattttg 
ccccaatttt tacaacggca ttgtagggct 
agggtttctt ttccctccag taattctttt 
aaatgttcag ctaaagtaaa cttacattca 
tttaatacaa tattgttgcc acacttagga 
ttttctttct gttcctggta atcagaaaaa 
tccagtaaat gaacaatgaa ttttttgata 
cttttaccga ttttttttaa atacgtttcc 
aggtgctgac tttcaactgc ctggcacaat 



aagtttatca aaaaaagact aacaaaaata 3780 
gaaaaaatga agaattaagc gttaatagtc 3840 
gggcaagaat gtgaataaag gccggataaa 3900 
aaaaggccgt aatatccagc tgaacggtct 3960 
atgcctcaaa atgttcttta cgatgccatt 4020 
tttttttctc cattttagct tccttagctc 4080 
ccggtagtga tcttatttca ttatggtgaa 4140 
ctcattttcg ccaaaagttg gcccagggct 4200 
tttattctgc gaagtgatct tccgtcacag 4260 
atgctgccaa cttactgatt tagtgtatga 4320 
gtttctatca gctgtccctc ctgttcagct 4380 
accgccggac atcagcgcta gcggagtgta 444 0 
gtgtcagtga agtgcttcat gtggcaggag 4500 
tatgtgatac aggatatatt ccgcttcctc 4560 
actgcggcga gcggaaatgg cttacgaacg 4620 
gatacttaac agggaagtga gagggccgcg 4 680 
cctgacaagc atcacgaaat ctgacgctca 474 0 
taaagatacc aggcgtttcc ccctggcggc 4800 
cggtttaccg gtgtcattcc gctgttatgg 4860 
cagttccggg taggcagttc gctccaagct 4 920 
cgaccgctgc gccttatccg gtaactatcg 4 980 
agcaccactg gcagcagcca ctggtaattg 5040 
ccggttaagg ctaaactgaa aggacaagtt 5100 
ctcggttcaa agagttggta gctcagagaa 5160 
tttcgttttc agagcaagag attacgcgca 5220 
ttaatcagat aaaatatttc tagatttcag 5280 
aagtcagccc catacgatat aagttgtaat 5340 
aattccccgt aggcgctagg gacctcttta 5400 
ataatttatc tccattccct ttagtaacgt 5460 
tagaattatt tcctcccgtzt aaataataga 5520 
taagtaatgg tacttaaatt gtttactttg 5580 
ttagtaaaca gttgacgata ttctcgattg 5640 
ttccaatatt tatctggaac atctgtggta 5700 
acttttggtt taggatgaaa gcattccgct 5760 
tgagtgtgca agagcaaccc tagtgttcgg 5820 
tcttcaacaa tcagatagat gtcagacgca 5880 
tgtgtgctta aatggtaagg aatattccca 5940 
tgaaactgta gaatatcttg gtgaattaaa 6000 
gacgataagt tgaatagatg actgtctaat 6060 
gccagtttcg tcgttaaatg ccctttacct 6120 
tttaaattca attgttttat tatttggttg 6180 
gaatatttta tatttttgtt catgtaatct 6240 
acagaagtta aacgatttgt ttgtaatttt 6300 
tcactggtag gaattaatct aacgtattta 6360 
tttcaaaaca gtagatgttt tatctacatt 6420 
ctccttctta attacaaatt tttagcatct 6480 
tttaagataa tgcactatca acacactctt 6540 
ttcttttacg tttccgccat tctttgctgt 6600 
acgaacacga accgtcttat ctcccattat 6660 
tcgtttttct ttttgtgcgc ctaaatttcc 6720 
tattcttatt ttatcatcaa caatcacaaa 6780 
ataagtgatt tgtgattaat atataaaagc 6840 
agaaagatat aaaatcaata tatccctttt 6900 
ttttctcttt gtttttgatt tccttttacc 6960 
acattcgttt tggttagttt tttctttcta 7020 
ggataatttg aacaaccata aaacgatttt 7080 
cattttccta caatactttt ttctgcttct 7140 
tttagttttt ctatatcgtt aggtacagct 7200 
ttcgtaataa agttctcttg attgccttct 7260 
catttagccg tcatttcagc actcgttaaa 7320 
aattttcctt tttcagttac aacagcctgc 7380 
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aggtcgacga taaccacata acagtcataa 
agaaatattt tggcattcta cgactataac 
taattgatta tagaataatg ttgctcatat 
ctatgttaca cttggtaata ttaaccgaac 
aataaatgga aaataatttt atagtaaatg 
tcaataacag caatatttca cgaacacaaa 
ctatttctaa cattctgaac aacttaaaac 
gtaatagtac taaaagtggt ggacgaaagc 
gctactatat ttctatggat ttaacatatg 
atgctactat attaaagcaa gattcctacg 
tacaaatttt aaaatctaat ataaacgtct 
ttggtatatc tatatccata cacggtatcg 
cttttcataa aaatgagaaa cgcacattta 
ctgtcgttat agaaaatgaa gcaaatttat 
attcaaacat aaataatttg attactttaa 
taataaataa aaaactttat cgtggctcaa 
tggttttgga atctataaat aacaatgaca 
cccaagacgc tttaatacag aaaataaata 
aactaatcca atattacaac gaaggaaatt 
ttaataaaat gacagttctg attcataatt 
atattaactg tcctttaatt aatgaattac 
tctcctgttt ttctcaaggc agtccaattc 
ctactttatt gggtggcact ttagcaataa 
aaatgaatat taaataatta cagcagtctg 
gtgaactttt ttgttgacaa agataaaaac 
gtttgtttat taaattaacc aactggacga 
gcttgggctt ttcagcctga tacagattaa 
gaatttgcct ggcggcagta gcgcggtggt 
gaaacgccgt agcgccgatg gtagtgtggg 
ggcatcaaat aaaacgaaag gctcagtcga 
tgtcggtgaa cgctctcctg agtaggacaa 
agcaacggcc cggagggtgg cgggcaggac 
agcagaaggc catcctgacg gatggccttt 
tttctaaata cattcaaata tgtatccgct 



aaccactcct ttttaacaaa ctttatcaca 74 40 
ttaaatttat attttttact ttataatata 7500 
cgtttgccaa catctagtac tcaaattaca 7560 
ttcccctgtc caaattagat aagaggtaat 7620 
aaaatgagaa gcgtgtatta aaacaaattt 7680 
tatcgaagaa tttagaactt aataaagcta 7740 
acaagagttt agttaatgaa gtaggagaag 7800 
ctattttact cgaaattaac caaaaatatg 7860 
attccgttga attaatgtac aactactttg 7920 
aattaaatga taaaaatgta agcagtatat 7980 
cagaaaaata tgatacgtta tatgggttac 8040 
ttgacgatga gcaaaacata atcaatcttc 8100 
ccgatgaatt aaagtcattc acaaatgttc 8160 
cagcgctata tgaaaaaagt ttatatatta 8220 
gtattcacaa gggtataggc gctggcatca 8280 
atggagaggc tggagagata ggtaagacat 8340 
acaaatatta taaaatcgaa gatatatgct 8400 
ataggttggg cgtcacattg acgtttacag 8460 
caattgttgc tcatgaaatt aaacaattta 8520 
tgaatacaca atttaaccca gacgctattt 8580 
caaatatttt aaatgaaatt aaagagcaat 8640 
aattacattt aactactaat gtaaaacaag 8700 
tgcaaaaaac attaaatata aataacattc 8760 
agttataaaa tagatatctc ggaccgtttc 8820 
acatgatata attaaatcac tcgagagtta 8880 
attcccgggg atccgtcgac ctgcagccaa 8940 
atcagaacgc agaagcggtc tgataaaaca 9000 
cccacctgac cccatgccga actcagaagt 9060 
gtctccccat gcgagagtag ggaactgcca 9120 
aagactgggc ctttcgtttt atctgttgtt 9180 
atccgccggg agcggatttg aacgttgcga 9240 
gcccgccata aactgccagg catcaaatta 9300 
ttgcgtttct acaaactctt tttgtttatt 9360 
gatccccatc ctatcgatcc cc 9412 



<210> 53 
<211> 6852 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pXyl-T5-DD pl5a 



<400> 53 

tggcattcta cgactataac 
tagaataatg ttgctcatat 
cttggtaata ttaaccgaac 
aaataatttt atagtaaatg 
caatatttca cgaacacaaa 
cattctgaac aacttaaaac 
taaaagtggt ggacgaaagc 
ttctatggat ttaacatatg 
attaaagcaa gattcctacg 
aaaatctaat ataaacgtct 
tatatccata cacggtatcg 
aaatgagaaa cgcacattta 
agaaaatgaa gcaaatttat 
aaataatttg attactttaa 
aaaactttat cgtggctcaa 



ttaaatttat attttttact 
cgtttgccaa catctagtac 
ttcccctgtc caaattagat 
aaaatgagaa gcgtgtatta 
tatcgaagaa tttagaactt 
acaagagttt agttaatgaa 
ctattttact cgaaattaac 
attccgttga attaatgtac 
aattaaatga taaaaatgta 
cagaaaaata tgatacgtta 
ttgacgatga gcaaaacata 
ccgatgaatt aaagtcattc 
cagcgctata tgaaaaaagt 
gtattcacaa gggtataggc 
atggagaggc tggagagata 



ttataatata taattgatta 60 

tcaaattaca ctatgttaca 120 

aagaggtaat aataaatgga 1Q0 

aaacaaattt tcaataacag 240 

aataaagcta ctatttctaa 300 

gtaggagaag gtaatagtac 360 

caaaaatatg gctactatat 420 

aactactttg atgctactat 480 

agcagtatat tacaaatttt 540 

tatgggttac ttggtatatc 600 

atcaatcttc cttttcataa 660 

acaaatgttc ctgtcgttat 720 

ttatatatta attcaaacat 780 

gctggcatca taataaataa 840 

ggtaagacat tggttttgga 900 
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atctataaat aacaatgaca acaaatatta 
tttaatacag aaaataaata ataggttggg 
atattacaac gaaggaaatt caattgttgc 
gacagttctg attcataatt tgaatacaca 
tcctttaatt aatgaattac caaatatttt 
ttctcaaggc agtccaat^c aattacattt 
gggtggcact ttagcaataa tgcaaaaaac 
taaataatta cagcagtctg agttataaaa 
ttttacatca ggaaaatttt tctgtatatt 
ccaactaaaa tgtagaattc gagctcggta 
caagcttggg cttttcagcc tgatacagat 
acagaatttg cctggcggca gtagcgcggt 
agtgaaacgc cgtagcgccg atggtagtgt 
ccaggcatca aataaaacga aaggctcagt 
gtttgtcggt gaacgctctc ctgagtagga 
cgaagcaacg gcccggaggg tggcgggcag 
ttaagcagaa ggccatcctg acggatggcc 
atttttctaa atacattcaa atatgtatcc 
tcaaacatga gaattaaatc aatctaaagt 
caatgcttaa tcagtgaggc acctatctca 
gcctgactcc ccgtcgtgta gataactacg 
gctgcaatga taccgcgaga cccacgctca 
ccagccggaa gggccgagcg cagaagtggt 
attaattgtt gccgggaagc tagagtaagt 
gttgccattg ctacaggcat cgtggtgtca 
tccggttccc aacgatcaag gcgagttaca 
agctccttcg gtcctccgat cgttgtcaga 
gttatggcag cactgcataa ttctcttact 
actggtgagt actcaaccaa gtcattctga 
tgcccggcgt caacacggga taataccgcg 
attggaaaac gctcttcggg gcgaaaactc 
tcgatgtaac ccactcgtgc acccaactga 
tctgggtgag caaaaacagg aaggcaaaat 
aaatgttgaa tactcatact cttccttttt 
tgtctcatga gcggatacat atttgaatgt 
cgcacatttc cccgaaaagt gccacctggc 
tttggtctgc gcgtaatctc ttgctctgaa 
tcgaaggttc tctgagctac caactctttg 
tcaccaaaac ttgtcctttc agtttagcct 
tctaaatcaa ttaccagtgg ctgctgccag 
ctcaagacga tagttaccgg ataaggcgca 
acagtccagc ttggagcgaa ctgcctaccc 
aacgcggcca taacagcgga atgacaccgg 
cgagggagcc gccaggggaa acgcctggta 
ctgatttgag cgtcagattt cgtgatgctt 
ctttgccgcg gccctctcac ttccctgtta 
ccgccccgtt cgtaagccat ttccgctcgc 
gtgagcgagg aagcggaata tatcctgtat 
ttttttctcc tgccacatga agcacttcac 
ccagtgcggc cgcttctttc ctgcgttatc 
ctttgagtga gctgataccg ctcgccgcag 
cgaggaagcg gaagagcgcc tgatgcggta 
acaccgcata agatcccctc gacctgcagg 
aaactatatg atttacccct aaatctttaa 
ttaaaattta aatatttctt gtgataaagt 
atgtggttat cgattatagg tatgtggttt 
gttattaaaa ataaaaatta gaccacgcat 
aagaaccctt aactaaactt gcagacgaat 
attcgacaag ttttgggatt gttaagggtt 
ataaagaagc gaaaaaggag aagtcggttc 
aatataaaaa ccttcttcaa ctaacggggc 



taaaatcgaa gatatatgct cccaagacgc 960 
cgtcacattg acgtttacag aactaatcca 1020 
tcatgaaatt aaacaattta ttaataaaat 1080 
atttaaccca gacgctattt atattaactg 1140 
aaatgaaatt aaagagcaat tctcctgttt 1200 
aactactaat gtaaaacaag ctactttatt 1260 
attaaatata aataacattc aaatgaatat 1320 
tagatatctc ggaccgtcat aaaaaattta 1380 
agattcaagt tagtttgttt attaaattaa 1440 
cccggggatc ctctagagtc gacctgcagc 1500 
taaatcagaa cgcagaagcg gtctgataaa 1560 
ggtcccacct gaccccatgc cgaactcaga 1620 
ggggtctccc catgcgagag tagggaactg 1680 
cgaaagactg ggcctttcgt tttatctgtt 1740 
caaatccgcc gggagcggat ttgaacgttg 1800 
gacgcccgcc ataaactgcc aggcatcaaa 1860 
tttttgcgtt tctacaaact ctttttgttt 1920 
gctgatcccc atcctatcga tgataagctg 1980 
atatatgagt aaacttggtc tgacagttac 2040 
gcgatctgtc tatttcgttc atccatagtt 2100 
atacgggagg gcttaccatc tggccccagt 2160 
ccggctccag atttatcagc aataaaccag 2220 
cctgcaactt tatccgcctc catccagtct 2280 
agttcgccag ttaatagttt gcgcaacgtt 2340 
cgctcgtcgt ttggtatggc ttcattcagc 2400 
tgatccccca tgttgtgcaa aaaagcggtt 2460 
agtaagttgg ccgcagtgtt atcactcatg 2520 
gtcatgccat ccgtaagatg cttttctgtg 2580 
gaatagtgta tgcggcgacc gagttgctct 2640 
ccacatagca gaactttaaa agtgctcatc 2700 
tcaaggatct taccgctgtt gagatccagt 2760 
tcttcagcat cttttacttt caccagcgtt 2820 
gccgcaaaaa agggaataag ggcgacacgg 2880 
caatattatt gaagcattta tcagggttat 2940 
atttagaaaa ataaacaaat aggggttccg 3000 
ggccgcataa gatgatcttc ttgagatcgt 3060 
aacgaaaaaa ccgccttgca gggcggtttt 3120 
aaccgaggta actggcttgg aggagcgcag 3180 
taaccggcgc atgacttcaa gactaactcc 3240 
tggtgctttt gcatgtcttt ccgggttgga 3300 
gcggtcggac tgaacggggg gttcgtgcat 3360 
ggaactgagt gtcaggcgtg gaatgagaca 3420 
taaaccgaaa ggcaggaaca ggagagcgca 3480 
tctttatagt cctgtcgggt ttcgccacca 3540 
gtcagggggg cggagcctat ggaaaaacgg 3600 
agtatcttcc tggcatcttc caggaaatct 3660 
cgcagtcgaa cgaccgagcg tagcgagtca 3720 
cacatattct gctgacgcac cggtgcagcc 3780 
tgacaccctc atcagtgcca acatagtaag 3840 
ccctgattct gtggataacc gtattaccgc 3900 
ccgaacgacc gagcgcagcg agtcagtgag 3960 
ttttctcctt acgcatctgt gcggtatttc 4020 
catgcaagct tctgtaggtt tttaggcata 4080 
aatgcccctt aaaattcaaa ataaaggcat 414 0 
ttgttaaaaa ggagtggttt tatgactgtt 4200 
tgtattggaa tggcattttt tgctatcaag 4260 
ttatgccgag aaaatttatt gtgcgttgag 4320 
gtcggcatag cgtgagctat taagccgacc 4380 
ccgaggctca acgtcaataa agcaattgga 4440 
agaaaaagaa ggatatggat ctggagctgt 4500 
aggttagtga cattagaaaa ccgactgtaa 4560 
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aaagtacagt cggcattatc tcatattata 
cctgaataga gttcataaac aatcctgcat 
gtaaagatag cggtaaatat attgaattac 
tgggtagaag gtaattacta ttattattga 
atggaataat agaaagagaa aaagcatttt 
ccgaaccatt atatttctct acatcagaaa 
tctttacagg agtccaaata ccagagaatg 
gtggctctaa cttatcccaa taacctaact 
ctgtatttga gtttatcacc cttgtcacta 
cttcttgttt tatgtttcgg tataaaacac 
tcgtttgttg gttcaaataa tgattaaata 
ttttattaaa gttcatttga tatgcctcct 
ttacttgtct gctttcttca ttagaatcaa 
ataaatatat attttaaaaa tatcccactt 
tgctttagtt gaagaataaa gaccacatta 
gatttgagcg tagcgaaaaa tccttttctt 
cggcgaggct agttaccctt aagttattgg 
ctttttcgta cctattaatg tatcgtttta 
ggaaaaagca acttttttta ttgtcatagt 
taacatggaa aagtatactg agaaaaaaca 
acgtcatatt ggagagaatc aaatggattt 
tgtagctgat aaaactttag aaaaacagaa 
attttgtcct gtctgtgctt ggagaaaagc 
gatgcaatat attaagcagc aagagaaaaa 
taatgtaatg agtgatgaat tagaaaatga 
acttataaag agaaaaaaag taggtagtgt 
tacatataat aaaaaaagag atgattataa 
taaatcgtat ttcacagata aaagatatta 
gcgtgatgta acgggcattt cagaaataac 
taataataaa gaattatatg aaatggctaa 
aaataaatca aaaagtcttt gatgcatttt 
tttattcagg attatttaaa gaggctaaaa 
taaaagaaat tgatccaacc gtaatatatc 
gagtatttag ctagtgaact ttatgactta 
aaaatgatag acgaaatcga ggaagaacaa 
caagtttagt taagggttct tctcaacgca 
ctaattttta tttttaataa ccttgatagc 
acctataatc gataaccaca taacagtcat 
caagaaatat tt 



aaagccagtc attaggccta tctgacaatt 4620 
gataaccatc acaaacagaa tgatgtacct 4680 
ctttattaat gaattttcct gctgtaataa 4740 
tatttaagtt aaacccagta aatgaagtcc 4800 
caggtatagg tgttttggga aacaatttcc 4860 
ggtataaatc ataaaactct ttgaagtcat 4920 
ttttagatac accatcaaaa attgtataaa 4 980 
ctccgtcgct attgtaacca gttctaaaag 5040 
agaaaataaa tgcagggtaa aatttatatc 5100 
taatatcaat ttctgtggtt atactaaaag 5160 
tctcttttct cttccaattg tctaaatcaa 5220 
aaatttttat ctaaagtgaa tttaggaggc 5280 
tcctttttta aaagtcaata ttactgtaac 5340 
tatccaattt tcgtttgttg aactaatggg 5400 
aaaaatgtgg tcttttgtgt ttttttaaag 5460 
tcttatcttg ataataaggg taactattgc 5520 
tatgactggt tttaagcgca aaaaaagttg 5580 
aatgactagt aaaaaacata catagaaagg 5640 
ttgtgaaaac taagttgttt ttatgtgtta 5700 
aagaaatcaa gtatttcaga aatttattaa 5760 
agttgaagat tgcaatacat ttctgtcttt 5820 
attatataaa gctaattctt gtaaaaatcg 5880 
tagaaaagat gcattgggtt tatctttgat 5940 
ggagtttatc tttttaactt tgactacacc 6000 
aataaaacgt tataataatt cttttagaaa 6060 
tataaaggga tatgttcgta agttagagat 6120 
tcctcatttt catgtgttaa ttgcagtaaa 6180 
tattagccaa caagaatggt tagatttatg 6240 
acaagttcaa gttcaaaaaa taagacaaaa 6300 
gtattctggt aaagatagtg attatttaat 6360 
ataaatcact taaaggtaaa caggtattag 6420 
agaaattaaa aaatggggat ttagattact 6480 
tatcaaattt tttatatttg gaaacaaaaa 6540 
acagaacaag aaaaaagaga aattaatcac 6600 
taacaaaata taagtgctaa cagtcgtctg 6660 
caataaattt tctcggcata aatgcgtggt 6720 
aaaaaatgcc attccaatac aaaaccacat 6780 
aaaaccactc ctttttaaca aactttatca 6840 

6852 



<210> 54 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 0R1 operator sequence 
<400> 54 

tatcaccgca gaggta 16 

<210> 55 
<211> 17 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> OR2 operator sequence 
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<400> 55 

taacaccgtg cgtgttg 



17 



<210> 56 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 0R3 operator sequence 
<400> 56 

tatcaccgca agggata 17 

<210> 57 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> OL1 operator sequence 



<210> 58 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> OL2 operator sequence 
<400> 58 

tatctctggc ggtgttg 17 

<210> 59 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> OL3 operator sequence 



<210> 60 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> MalM operator sequence 
<400> 60 

cgcaaacgtt tgcgt 15 

<210> 61 
<211> 14 
<212> DNA 



<400> 57 

tatcaccgcc agtggta 



17 



<400> 59 

tatcaccgca gatggtt 



17 
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<213> Artificial Sequence 



<220> 

<223> MalX operator sequence 



<400> 61 
cgcaaacgtt ttgg 



14 



<210> 62 
<211> 38 
<212> DNA 

<213> Escherichia coli 
<400> 62 

actttatcac tgataaacaa acttatcagt gataaaga 38 

<210> 63 
<211> 38 
<212> DNA 

<213> Escherichia coli 



<210> 64 
<211> 38 
<212> DNA 

<213> Plasmid pSClOl 
<400> 64 

agcttatcat cgataagcta gtttatcaca gttaaatt 38 

<210> 65 

<211> 39 

<212> DNA 

<213> Salmonella 



<210> 66 
<211> 38 
<212> DNA 

<213> Transposon TnlO 
<400> 66 

aatctatcac tgatagagta ccctatcatc gatagaga 38 

<210> 67 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Trp O operator sequence 



<400> 63 

actctatcat tgatagagtt ccctatcagt gatagaga 



38 



<400> 65 

actctatcat tgataaggga actctatcaa tgataggga 



39 



<400> 67 

cgaactagtt aactagtacg 



20 



<210> 68 
<211> 29 
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<212> DNA 

<213> Artificial Sequence 
<2Z0> 

<223> Xyl O operator sequence 



<400> 68 

agttagtttg tttattaaat taaccaact 



29 
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